Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+668.29%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (+82.93%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+1395.12%)
CollapseAdvanced and Fast Data Transformation in R
Stars: ✭ 184 (+348.78%)
Scikit Mobilityscikit-mobility: mobility analysis in Python
Stars: ✭ 339 (+726.83%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+2304.88%)
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (+768.29%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+853.66%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+12297.56%)
Awesome RA curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+11748.78%)
PycaretAn open-source, low-code machine learning library in Python
Stars: ✭ 4,594 (+11104.88%)
KneedKnee point detection in Python 📈
Stars: ✭ 328 (+700%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+636.59%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (+817.07%)
SktimeA unified framework for machine learning with time series
Stars: ✭ 4,741 (+11463.41%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+1007.32%)
Pandas SummaryAn extension to pandas dataframes describe function.
Stars: ✭ 361 (+780.49%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+12839.02%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+2292.68%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+1431.71%)
H1stThe AI Application Platform We All Need. Human AND Machine Intelligence. Based on experience building AI solutions at Panasonic: robotics predictive maintenance, cold-chain energy optimization, Gigafactory battery mfg, avionics, automotive cybersecurity, and more.
Stars: ✭ 697 (+1600%)
TsfreshAutomatic extraction of relevant features from time series:
Stars: ✭ 6,077 (+14721.95%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+1617.07%)
SealionThe first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Stars: ✭ 278 (+578.05%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+19321.95%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (+570.73%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+10470.73%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+10600%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+739.02%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+565.85%)
DataexplorerAutomate Data Exploration and Treatment
Stars: ✭ 362 (+782.93%)
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (+753.66%)
SeglearnPython module for machine learning time series:
Stars: ✭ 435 (+960.98%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (+943.9%)
GopGoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+18995.12%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+7139.02%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+1219.51%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+13600%)
RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (+1182.93%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+1417.07%)
Matrixprofile TsA Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (+1414.63%)
DapyEasy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (+1175.61%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-46.34%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-36.59%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-60.98%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (+2168.29%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+1919.51%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+2007.32%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+417.07%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+11987.8%)