H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+1830.38%)
Interpretable machine learning with pythonExamples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (+80.89%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+526.28%)
VdsVerteego Data Suite
Stars: ✭ 9 (-96.93%)
Mli ResourcesH2O.ai Machine Learning Interpretability Resources
Stars: ✭ 428 (+46.08%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-77.82%)
H2o TutorialsTutorials and training material for the H2O Machine Learning Platform
Stars: ✭ 1,305 (+345.39%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (-25.94%)
NimbusmlPython machine learning package providing simple interoperability between ML.NET and scikit-learn components.
Stars: ✭ 265 (-9.56%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-6.14%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1266.21%)
R4dsR for data science: a book
Stars: ✭ 3,231 (+1002.73%)
Apd CoreCore repo for
Stars: ✭ 264 (-9.9%)
Openintro Statistics📚 An open-source textbook written at the college level. OpenIntro also offers a second college-level intro stat textbook and also a high school variant.
Stars: ✭ 283 (-3.41%)
Data Science LearningRepository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
Stars: ✭ 273 (-6.83%)
PolyaxonMachine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+912.29%)
Python Is CoolCool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
Stars: ✭ 2,962 (+910.92%)
AtlasAn Open Source, Self-Hosted Platform For Applied Deep Learning Development
Stars: ✭ 259 (-11.6%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-2.05%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-3.41%)
Open Quant Live BookAn open source, hands-on and fully reproducible book in quantitative finance, data science and econophysics. Join us and help Make Wall Street Great Again!
Stars: ✭ 275 (-6.14%)
ForestFlowForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.
Stars: ✭ 55 (-81.23%)
steamDEPRECATED Build, manage and deploy H2O's high-speed machine learning models.
Stars: ✭ 59 (-79.86%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+912.97%)
FacetHuman-explainable AI.
Stars: ✭ 269 (-8.19%)
ShogunShōgun
Stars: ✭ 2,859 (+875.77%)
Awesome Mlops😎 A curated list of awesome MLOps tools
Stars: ✭ 258 (-11.95%)
SagifyMLOps for AWS SageMaker. www.sagifyml.com
Stars: ✭ 277 (-5.46%)
GoroA High-level Machine Learning Library for Go
Stars: ✭ 265 (-9.56%)
Dirty catEncoding methods for dirty categorical variables
Stars: ✭ 259 (-11.6%)
DowhyDoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Stars: ✭ 3,480 (+1087.71%)
Uncertainty BaselinesHigh-quality implementations of standard and SOTA methods on a variety of tasks.
Stars: ✭ 278 (-5.12%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (-11.26%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-6.83%)
Course NlpA Code-First Introduction to NLP course
Stars: ✭ 3,029 (+933.79%)
LanternData exploration glue
Stars: ✭ 292 (-0.34%)
sldm4-h2oStatistical Learning & Data Mining IV - H2O Presenation & Tutorial
Stars: ✭ 26 (-91.13%)
Flux.jlRelax! Flux is the ML library that doesn't make you tensor
Stars: ✭ 3,358 (+1046.08%)
pyh2oPython binding for the H2O HTTP server
Stars: ✭ 25 (-91.47%)
QframeImmutable data frame for Go
Stars: ✭ 282 (-3.75%)
ChartifyPython library that makes it easy for data scientists to create charts.
Stars: ✭ 3,054 (+942.32%)
mercury-mlMercury-ML is an open source Machine Learning workflow management library. Its core contributors are employees of Alexander Thamm GmbH
Stars: ✭ 37 (-87.37%)
skutilNOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learn and h2o extension classes (as well as caret classes for python). See more here: https://tgsmith61591.github.io/skutil
Stars: ✭ 29 (-90.1%)
exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-92.15%)
Python ArticlesMonthly Series - Top 10 Python Articles
Stars: ✭ 288 (-1.71%)
nih-chest-xrayIdentifying diseases in chest X-rays using convolutional neural networks
Stars: ✭ 83 (-71.67%)
forecastVegA Machine Learning Approach to Forecasting Remotely Sensed Vegetation Health in Python
Stars: ✭ 44 (-84.98%)
KerasDeep Learning for humans
Stars: ✭ 53,476 (+18151.19%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+5879.52%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1463.48%)