Uncertainty BaselinesHigh-quality implementations of standard and SOTA methods on a variety of tasks.
Stars: ✭ 278 (-20.57%)
QframeImmutable data frame for Go
Stars: ✭ 282 (-19.43%)
Apricotapricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly. See the documentation page: https://apricot-select.readthedocs.io/en/latest/index.html
Stars: ✭ 306 (-12.57%)
Arcgis Osm EditorArcGIS Editor for OpenStreetMap is a toolset for GIS users to access and contribute to OpenStreetMap through their Desktop or Server environment.
Stars: ✭ 281 (-19.71%)
SealionThe first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Stars: ✭ 278 (-20.57%)
Xam🎯 Personal data science and machine learning toolbox
Stars: ✭ 306 (-12.57%)
ArtificioDeep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (-6.86%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-21.43%)
CartolaExtração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (-13.14%)
Data Science LearningRepository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
Stars: ✭ 273 (-22%)
ThesemicolonThis repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.
Stars: ✭ 345 (-1.43%)
Open Quant Live BookAn open source, hands-on and fully reproducible book in quantitative finance, data science and econophysics. Join us and help Make Wall Street Great Again!
Stars: ✭ 275 (-21.43%)
DatasetsA repository of pretty cool datasets that I collected for network science and machine learning research.
Stars: ✭ 302 (-13.71%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+748%)
IrodsOpen Source Data Management Software
Stars: ✭ 321 (-8.29%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-13.71%)
Scikit Learn VideosJupyter notebooks from the scikit-learn video series
Stars: ✭ 3,254 (+829.71%)
R4dsR for data science: a book
Stars: ✭ 3,231 (+823.14%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+1138.29%)
NimbusmlPython machine learning package providing simple interoperability between ML.NET and scikit-learn components.
Stars: ✭ 265 (-24.29%)
Apd CoreCore repo for
Stars: ✭ 264 (-24.57%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (-0.57%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1043.71%)
PycaretAn open-source, low-code machine learning library in Python
Stars: ✭ 4,594 (+1212.57%)
PolyaxonMachine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+747.43%)
EvidentlyInteractive reports to analyze machine learning models during validation or production monitoring.
Stars: ✭ 304 (-13.14%)
Python Is CoolCool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
Stars: ✭ 2,962 (+746.29%)
PreqlAn interpreted relational query language that compiles to SQL.
Stars: ✭ 257 (-26.57%)
AtlasAn Open Source, Self-Hosted Platform For Applied Deep Learning Development
Stars: ✭ 259 (-26%)
Dash Docs📖 The Official Dash Userguide & Documentation
Stars: ✭ 338 (-3.43%)
Data visualizationA collection of my data visualizations, mostly in Python.
Stars: ✭ 294 (-16%)
LinkedDataHubThe Knowledge Graph notebook. Apache license.
Stars: ✭ 150 (-57.14%)
Carefree LearnA minimal Automatic Machine Learning (AutoML) solution for tabular datasets based on PyTorch
Stars: ✭ 316 (-9.71%)
telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (-37.43%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+1071.14%)
VBA-CSV-interfaceThe most powerful and comprehensive CSV/TSV/DSV data management library for VBA, providing parsing/writing capabilities compliant with RFC-4180 specifications and a complete set of tools for manipulating records and fields.
Stars: ✭ 24 (-93.14%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (-1.71%)
Awesome H2oA curated list of research, applications and projects built using the H2O Machine Learning platform
Stars: ✭ 293 (-16.29%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (-94.57%)
Scikit RebateA scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
Stars: ✭ 314 (-10.29%)
LanternData exploration glue
Stars: ✭ 292 (-16.57%)
conduitSimplified Data Exchange for HPC Simulations
Stars: ✭ 114 (-67.43%)
lightdashAn open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+209.14%)
MlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+965.43%)
Pm4py CorePublic repository for the PM4Py (Process Mining for Python) project.
Stars: ✭ 313 (-10.57%)
Issue Label BotCode For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
Stars: ✭ 292 (-16.57%)
atrocoreAtroCore is an open-source Data Platform, Data Management and Master Data Management (MDM) software, which can be used to quickly create any business application.
Stars: ✭ 38 (-89.14%)
Dash CytoscapeInteractive network visualization in Python and Dash, powered by Cytoscape.js
Stars: ✭ 309 (-11.71%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-18%)