AtmAuto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).
Stars: ✭ 504 (-42.73%)
Spark NotebookInteractive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+250.11%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+1.25%)
Issue Label BotCode For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
Stars: ✭ 292 (-66.82%)
Querido Diario📰 Brazilian government gazettes, accessible to everyone.
Stars: ✭ 681 (-22.61%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-67.39%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (-68.41%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-1.82%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+536.36%)
Uncertainty BaselinesHigh-quality implementations of standard and SOTA methods on a variety of tasks.
Stars: ✭ 278 (-68.41%)
QframeImmutable data frame for Go
Stars: ✭ 282 (-67.95%)
OpenmlOpen Machine Learning
Stars: ✭ 489 (-44.43%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (-68.07%)
LuxPython API for Intelligent Visual Data Discovery
Stars: ✭ 787 (-10.57%)
SealionThe first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Stars: ✭ 278 (-68.41%)
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Stars: ✭ 6,022 (+584.32%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-68.75%)
Machine Learning RoadmapA roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Stars: ✭ 5,277 (+499.66%)
Data Science LearningRepository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
Stars: ✭ 273 (-68.98%)
Docker ImagesOut-of-box Data Science / AI platform | AI/数据科学的瑞士军刀
Stars: ✭ 25 (-97.16%)
Open Quant Live BookAn open source, hands-on and fully reproducible book in quantitative finance, data science and econophysics. Join us and help Make Wall Street Great Again!
Stars: ✭ 275 (-68.75%)
Awesome RoboticsA curated list of awesome links and software libraries that are useful for robots.
Stars: ✭ 478 (-45.68%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+237.27%)
Kaggle Cli(Deprecated, use https://github.com/Kaggle/kaggle-api instead) An unofficial Kaggle command line tool.
Stars: ✭ 675 (-23.3%)
Best Of Ml Python🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Stars: ✭ 6,057 (+588.3%)
GophernotesThe Go kernel for Jupyter notebooks and nteract.
Stars: ✭ 3,100 (+252.27%)
RioA Swiss-Army Knife for Data I/O
Stars: ✭ 467 (-46.93%)
FacetHuman-explainable AI.
Stars: ✭ 269 (-69.43%)
Test TubePython library to easily log experiments and parallelize hyperparameter search for neural networks
Stars: ✭ 663 (-24.66%)
ShogunShōgun
Stars: ✭ 2,859 (+224.89%)
Mlr3mlr3: Machine Learning in R - next generation
Stars: ✭ 463 (-47.39%)
Awesome Mlops😎 A curated list of awesome MLOps tools
Stars: ✭ 258 (-70.68%)
PbaEfficient Learning of Augmentation Policy Schedules
Stars: ✭ 461 (-47.61%)
PrefectThe easiest way to automate your data
Stars: ✭ 7,956 (+804.09%)
AlphapyAutomated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost
Stars: ✭ 564 (-35.91%)
Dirty catEncoding methods for dirty categorical variables
Stars: ✭ 259 (-70.57%)
DowhyDoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Stars: ✭ 3,480 (+295.45%)
GopGoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+789.66%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (-70.45%)
Nlp📝 This repository recorded my NLP journey.
Stars: ✭ 820 (-6.82%)
GenrlA PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
Stars: ✭ 356 (-59.55%)
GirderA data management platform for the web, developed by Kitware
Stars: ✭ 350 (-60.23%)
BestofmlThe best resources around Machine Learning
Stars: ✭ 349 (-60.34%)
Awesome MlopsA curated list of references for MLOps
Stars: ✭ 7,119 (+708.98%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+846.48%)
Pydata.krPyData Korea 공식 홈페이지입니다. (준비중)
Stars: ✭ 13 (-98.52%)