Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+774.51%)
NlpaugData augmentation for NLP
Stars: ✭ 2,761 (+1704.58%)
ImodelsInterpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (+26.8%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-28.76%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+196.73%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+603.27%)
Pragmaticai[Book-2019] Pragmatic AI: An Introduction to Cloud-based Machine Learning
Stars: ✭ 79 (-48.37%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-26.14%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+1794.77%)
RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (+243.79%)
Tensor HouseA collection of reference machine learning and optimization models for enterprise operations: marketing, pricing, supply chain
Stars: ✭ 449 (+193.46%)
Awesome MlopsA curated list of references for MLOps
Stars: ✭ 7,119 (+4552.94%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+253.59%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-16.34%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+3596.73%)
Datasist A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-19.61%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (+507.84%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+458.17%)
Image classifierCNN image classifier implemented in Keras Notebook 🖼️.
Stars: ✭ 139 (-9.15%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+464.71%)
MachinelearningcourseA collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-77.12%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+544.44%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+686.93%)
Metaflow🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+3238.56%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (+179.74%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-48.37%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+3367.32%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+323.53%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+105.88%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-89.54%)
Awesome Ai Ml DlAwesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (+443.14%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+458.17%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+360.13%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+5343.79%)
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (+128.76%)
RecommendersBest Practices on Recommendation Systems
Stars: ✭ 11,818 (+7624.18%)
Datacamp🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-54.9%)
Fklearnfklearn: Functional Machine Learning
Stars: ✭ 1,305 (+752.94%)
DopamineDopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Stars: ✭ 9,681 (+6227.45%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+800.65%)
Azureml ExamplesOfficial community-driven Azure Machine Learning examples, tested with GitHub Actions
Stars: ✭ 101 (-33.99%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-25.49%)
Ml Workspace🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+1427.45%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+1021.57%)
Csinva.github.ioSlides, paper notes, class notes, blog posts, and research on ML 📉, statistics 📊, and AI 🤖.
Stars: ✭ 342 (+123.53%)
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (+132.68%)
Computervision RecipesBest Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+5268.63%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-30.07%)