AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+8381.03%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+194.83%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+2041.38%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+1755.17%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+2018.97%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+574.14%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (+15.52%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+7372.41%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+8444.83%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-72.41%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+370.69%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+14608.62%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (+548.28%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (+637.93%)
RioA Swiss-Army Knife for Data I/O
Stars: ✭ 467 (+705.17%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+682.76%)
Machine Learning RoadmapA roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Stars: ✭ 5,277 (+8998.28%)
Awesome RA curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+8275.86%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+832.76%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (+1748.28%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+922.41%)
Disk.frameFast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Stars: ✭ 517 (+791.38%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+956.9%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+982.76%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+612.07%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+443.1%)
GopGoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+13398.28%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+8663.79%)
DataexplorerAutomate Data Exploration and Treatment
Stars: ✭ 362 (+524.14%)
RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (+806.9%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+9046.55%)
DapyEasy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (+801.72%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+917.24%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+9584.48%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+972.41%)
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (+503.45%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+1225.86%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+1327.59%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (+1174.14%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-62.07%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-55.17%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+1113.79%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-68.97%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (+1503.45%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+1372.41%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+1567.24%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+13629.31%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+1591.38%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+14260.34%)