Data Science Interview ResourcesA repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.
FeatexpFeature exploration for supervised learning
Querido Diario📰 Brazilian government gazettes, accessible to everyone.
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Kaggle Cli(Deprecated, use https://github.com/Kaggle/kaggle-api instead) An unofficial Kaggle command line tool.
Ipython DashboardA stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.
Test TubePython library to easily log experiments and parallelize hyperparameter search for neural networks
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
TsfreshAutomatic extraction of relevant features from time series:
FeaturetoolsAn open source python library for automated feature engineering
Nteract📘 The interactive computing suite for you! ✨
DataprepDataPrep — The easiest way to prepare data in Python
Zero To Mastery MlAll course materials for the Zero to Mastery Machine Learning and Data Science course.
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Fastai2Temporary home for fastai v2 while it's being developed
Boltons🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
LazydataLazydata: Scalable data dependencies for Python projects
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Matrixprofile TsA Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
ElkiELKI Data Mining Toolkit
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Sigma coding youtubeThis is a collection of all the code that can be found on my YouTube channel Sigma Coding.
MoviegeekA django website used in the book Practical Recommender Systems to illustrate how recommender algorithms can be implemented.
SmileStatistical Machine Intelligence & Learning Engine
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
PdpipeEasy pipelines for pandas DataFrames.
Awesome Ai UsecasesA list of awesome and proven Artificial Intelligence use cases and applications
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Vehicle counting tensorflow🚘 "MORE THAN VEHICLE COUNTING!" This project provides prediction for speed, color and size of the vehicles with TensorFlow Object Counting API.
Data Science CompetitionsGoal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
BaikalA graph-based functional API for building complex scikit-learn pipelines.
Pygam[HELP REQUESTED] Generalized Additive Models in Python
AlphapyAutomated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost
Data Science PortfolioPortfolio of data science projects completed by me for academic, self learning, and hobby purposes.
NipypeWorkflows and interfaces for neuroimaging packages
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Intro To PythonAn intro to Python & programming for wanna-be data scientists
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method