exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-91.12%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-15.83%)
Voice GenderGender recognition by voice and speech analysis
Stars: ✭ 248 (-4.25%)
CardioCardIO is a library for data science research of heart signals
Stars: ✭ 218 (-15.83%)
nepali-translatorNeural Machine Translation on the Nepali-English language pair
Stars: ✭ 29 (-88.8%)
TutorialsAI-related tutorials. Access any of them for free → https://towardsai.net/editorial
Stars: ✭ 204 (-21.24%)
CjworkbenchThe data journalism platform with built in training
Stars: ✭ 244 (-5.79%)
Sc17SuperComputing 2017 Deep Learning Tutorial
Stars: ✭ 211 (-18.53%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (-89.19%)
Covid19zaCoronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
Stars: ✭ 208 (-19.69%)
RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (-6.95%)
Eli5A library for debugging/inspecting machine learning classifiers and explaining their predictions
Stars: ✭ 2,477 (+856.37%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (+0.39%)
ScihubSource code and data analyses for the Sci-Hub Coverage Study
Stars: ✭ 205 (-20.85%)
Opends4allOpenDS4All project, hosted by LF AI & Data
Stars: ✭ 240 (-7.34%)
Python For Data ScienceA collection of Jupyter Notebooks for learning Python for Data Science.
Stars: ✭ 205 (-20.85%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+421.62%)
Estadistica Con RApuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (-22.39%)
Ntm One Shot TfOne Shot Learning using Memory-Augmented Neural Networks (MANN) based on Neural Turing Machine architecture in Tensorflow
Stars: ✭ 238 (-8.11%)
InstascrapePowerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (-22.01%)
foofahFoofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (-90.73%)
FastpagesAn easy to use blogging platform, with enhanced support for Jupyter Notebooks.
Stars: ✭ 2,888 (+1015.06%)
AchooAchoo uses a Raspberry Pi to predict if my son will need his inhaler on any given day using weather, pollen, and air quality data. If the prediction for a given day is above a specified threshold, the Pi will email his school nurse, and myself, notifying her that he may need preemptive treatment. Community-sourced health monitoring!
Stars: ✭ 200 (-22.78%)
KerasDeep Learning for humans
Stars: ✭ 53,476 (+20547.1%)
PloomberA convention over configuration workflow orchestrator. Develop locally (Jupyter or your favorite editor), deploy to Airflow or Kubernetes.
Stars: ✭ 221 (-14.67%)
DowhyDoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Stars: ✭ 3,480 (+1243.63%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (-24.32%)
Data Science FreeFree Resources For Data Science created by Shubham Kumar
Stars: ✭ 232 (-10.42%)
TadA desktop application for viewing and analyzing tabular data
Stars: ✭ 2,275 (+778.38%)
RayAn open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Stars: ✭ 18,547 (+7061%)
ImodelsInterpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (-25.1%)
Prodigy Recipes🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Stars: ✭ 229 (-11.58%)
GophernetA simple from-scratch neural net written in Go
Stars: ✭ 194 (-25.1%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (-53.67%)
MachinelearningnotebooksPython notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
Stars: ✭ 2,790 (+977.22%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+975.29%)
PlynxPLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.
Stars: ✭ 192 (-25.87%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+1075.29%)
SpeedmlSpeedml is a Python package to speed start machine learning projects.
Stars: ✭ 192 (-25.87%)
Uci Ml ApiSimple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Stars: ✭ 190 (-26.64%)
Course NlpA Code-First Introduction to NLP course
Stars: ✭ 3,029 (+1069.5%)
ElasticR client for the Elasticsearch HTTP API
Stars: ✭ 227 (-12.36%)
VirgilioVirgilio is developed and maintained by these awesome people.
You can email us virgilio.datascience (at) gmail.com or join the Discord chat.
Stars: ✭ 13,200 (+4996.53%)
AlphatoolsQuantitative finance research tools in Python
Stars: ✭ 226 (-12.74%)
DelbotIt understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
Stars: ✭ 191 (-26.25%)
ObservationsTools for loading standard data sets in machine learning
Stars: ✭ 190 (-26.64%)
StreamlitStreamlit — The fastest way to build data apps in Python
Stars: ✭ 16,906 (+6427.41%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-27.41%)
errorlocateFind and replace erroneous fields in data using validation rules
Stars: ✭ 19 (-92.66%)
Pytorch LightningThe lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Stars: ✭ 16,641 (+6325.1%)
DarwinexlabsDatasets, tools and more from Darwinex Labs - Prop Investing Arm & Quant Team @ Darwinex
Stars: ✭ 248 (-4.25%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-12.74%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-12.74%)