KlibEasy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (+62.71%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (+63.56%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+84.75%)
MatplotplusplusMatplot++: A C++ Graphics Library for Data Visualization 📊🗾
Stars: ✭ 2,433 (+1961.86%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+2260.17%)
StreamlitStreamlit — The fastest way to build data apps in Python
Stars: ✭ 16,906 (+14227.12%)
Igela delightful machine learning tool that allows you to train, test, and use models without writing code
Stars: ✭ 2,956 (+2405.08%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+91.53%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+2415.25%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (+133.05%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1033.9%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+1468.64%)
KneedKnee point detection in Python 📈
Stars: ✭ 328 (+177.97%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+3572.88%)
Pandas SummaryAn extension to pandas dataframes describe function.
Stars: ✭ 361 (+205.93%)
SealionThe first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Stars: ✭ 278 (+135.59%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-9.32%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (+218.64%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+166.95%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+284.75%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (+262.71%)
Awesome RA curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+4016.95%)
BatchflowBatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Stars: ✭ 156 (+32.2%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+4395.76%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+419.49%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+496.61%)
Fklearnfklearn: Functional Machine Learning
Stars: ✭ 1,305 (+1005.93%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+358.47%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (+688.14%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-81.36%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+623.73%)
Steppy ToolkitCurated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-82.2%)
Mlj.jlA Julia machine learning framework
Stars: ✭ 982 (+732.2%)
RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (+345.76%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+811.86%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (+808.47%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-50.85%)
Datacamp🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-41.53%)
MlboxMLBox is a powerful Automated Machine Learning python library.
Stars: ✭ 1,199 (+916.1%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (+29.66%)
DapyEasy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (+343.22%)
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-65.25%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1184.75%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+952.54%)