Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
DgfraudA Deep Graph-based Toolbox for Fraud Detection
LudwigData-centric declarative deep learning framework
Metaflow🚀 Build and manage real-life data science projects with ease!
MimesisMimesis is a high-performance fake data generator for Python, which provides data for a variety of purposes in a variety of languages.
140stories140Stories: Collaborative stories 140 chars at a time.
Open-Data-Laban initiative to provide infrastructure for reproducible workflows around open data
autonomioCore functionality for the Autonomio augmented intelligence workbench.
pythonPython codes from tutorials on the Data Professor YouTube channel
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
neptune-examplesExamples of using Neptune to keep track of your experiments (maintenance only).
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
genstarGeneration of Synthetic Populations Library
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
data-science-best-practicesThe goal of this repository is to enable data scientists and ML engineers to develop data science use cases and making it ready for production use. This means focusing on the versioning, scalability, monitoring and engineering of the solution.
HackyHourHandbookA handbook for those who want to start coordinating Hacky Hour events in their University/Institute
nl4dvA python toolkit to create Visualizations (Vis) using natural language (NL) or add an NL interface to existing Vis.
genero-nomesClassifica nomes por gênero de acordo com API do IBGE
dstyet another custom data science template via cookiecutter
d20datascienceData science investigations into the mechanics of the world's greatest role playing game
snorkelSnorkel - Bootstrap your Data Science
RcppDynProgDynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.
AgePredictorAge classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum
ScalaTIKZScalaTIKZ is an open-source library for PGF/TIKZ vector graphics.
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
R-data-wranglingMaterials for my my R data workshop. https://cengel.github.io/R-data-wrangling/
data-science-popular-algorithmsData Science algorithms and topics that you must know. (Newly Designed) Recommender Systems, Decision Trees, K-Means, LDA, RFM-Segmentation, XGBoost in Python, R, and Scala.
ML-CaPsuleML-capsule is a Project for beginners and experienced data science Enthusiasts who don't have a mentor or guidance and wish to learn Machine learning. Using our repo they can learn ML, DL, and many related technologies with different real-world projects and become Interview ready.
objectiv-analyticsPowerful product analytics for data teams, with full control over data & models.
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
awesome-conformal-predictionA professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.
primrosePrimrose modeling framework for simple production models
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
k3aiA lightweight tool to get an AI Infrastructure Stack up in minutes not days. K3ai will take care of setup K8s for You, deploy the AI tool of your choice and even run your code on it.