H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (-14.16%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (-91.05%)
FeaturetoolsAn open source python library for automated feature engineering
Stars: ✭ 5,891 (-10.59%)
LazydataLazydata: Scalable data dependencies for Python projects
Stars: ✭ 627 (-90.48%)
Book sampleanother book on data science
Stars: ✭ 611 (-90.73%)
Vehicle counting tensorflow🚘 "MORE THAN VEHICLE COUNTING!" This project provides prediction for speed, color and size of the vehicles with TensorFlow Object Counting API.
Stars: ✭ 582 (-91.17%)
Zero To Mastery MlAll course materials for the Zero to Mastery Machine Learning and Data Science course.
Stars: ✭ 631 (-90.42%)
Boltons🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Stars: ✭ 5,671 (-13.93%)
Data Science PortfolioPortfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (-91.52%)
Ipython DashboardA stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.
Stars: ✭ 664 (-89.92%)
Matrixprofile TsA Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (-90.58%)
FeatexpFeature exploration for supervised learning
Stars: ✭ 688 (-89.56%)
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (-90.7%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (-90.17%)
SmileStatistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (-17.86%)
OrchestA new kind of IDE for Data Science.
Stars: ✭ 694 (-89.47%)
Awesome Ai UsecasesA list of awesome and proven Artificial Intelligence use cases and applications
Stars: ✭ 587 (-91.09%)
BaikalA graph-based functional API for building complex scikit-learn pipelines.
Stars: ✭ 573 (-91.3%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-90.39%)
Fastai2Temporary home for fastai v2 while it's being developed
Stars: ✭ 630 (-90.44%)
NipypeWorkflows and interfaces for neuroimaging packages
Stars: ✭ 557 (-91.55%)
Kaggle Cli(Deprecated, use https://github.com/Kaggle/kaggle-api instead) An unofficial Kaggle command line tool.
Stars: ✭ 675 (-89.76%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-90.47%)
Data Science Interview ResourcesA repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.
Stars: ✭ 690 (-89.53%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-90.56%)
Test TubePython library to easily log experiments and parallelize hyperparameter search for neural networks
Stars: ✭ 663 (-89.94%)
EngsoccerdataEnglish and European soccer results 1871-2020
Stars: ✭ 615 (-90.67%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-89.32%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-90.7%)
Sigma coding youtubeThis is a collection of all the code that can be found on my YouTube channel Sigma Coding.
Stars: ✭ 611 (-90.73%)
Querido Diario📰 Brazilian government gazettes, accessible to everyone.
Stars: ✭ 681 (-89.66%)
MoviegeekA django website used in the book Practical Recommender Systems to illustrate how recommender algorithms can be implemented.
Stars: ✭ 608 (-90.77%)
TsfreshAutomatic extraction of relevant features from time series:
Stars: ✭ 6,077 (-7.77%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-91%)
Nteract📘 The interactive computing suite for you! ✨
Stars: ✭ 5,713 (-13.29%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (-14.75%)
Data Science CompetitionsGoal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
Stars: ✭ 572 (-91.32%)
DataprepDataPrep — The easiest way to prepare data in Python
Stars: ✭ 639 (-90.3%)
Pygam[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (-91.36%)
H1stThe AI Application Platform We All Need. Human AND Machine Intelligence. Based on experience building AI solutions at Panasonic: robotics predictive maintenance, cold-chain energy optimization, Gigafactory battery mfg, avionics, automotive cybersecurity, and more.
Stars: ✭ 697 (-89.42%)
AlphapyAutomated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost
Stars: ✭ 564 (-91.44%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-90.39%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (-19.49%)
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Stars: ✭ 6,022 (-8.61%)
Industry Machine LearningA curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Stars: ✭ 6,077 (-7.77%)
ReflowA language and runtime for distributed, incremental data processing in the cloud
Stars: ✭ 706 (-89.29%)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (-90.44%)