RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (-59.69%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-88.28%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (-17.55%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (-5.82%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-58.54%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-46.05%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (-50.34%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-98.31%)
Mldmпотоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Stars: ✭ 35 (-97.32%)
Awesome MlopsA curated list of references for MLOps
Stars: ✭ 7,119 (+445.52%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+289.5%)
DapyEasy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (-59.92%)
GopGoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+499.92%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-52.34%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-51.88%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-98.77%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+306.51%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+510.19%)
MachinelearningcourseA collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-97.32%)
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-96.86%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (-17.85%)
DltkDeep Learning Toolkit for Medical Image Analysis
Stars: ✭ 1,249 (-4.29%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-65.21%)
Metaflow🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+291.42%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-5.13%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (-67.2%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+279.77%)
Awesome RA curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+272.26%)
Machinejs[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
Stars: ✭ 412 (-68.43%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (-4.83%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-53.03%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+330.42%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-93.95%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-36.55%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+538.24%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-33.79%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (-7.74%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-34.56%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-24.44%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (-24.83%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-98.01%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-94.87%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-95.56%)
Datacamp🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-94.71%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+553.72%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (-71.19%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (-70.04%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (-28.74%)
Dream3dData Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-94.41%)