All Projects → Data Science Cookbook → Similar Projects or Alternatives

7118 Open source projects that are alternatives of or similar to Data Science Cookbook

Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-51.67%)
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1563.33%)
Mutual labels:  jupyter-notebook, data-science, spark
Pymc Example Project
Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
Stars: ✭ 90 (+50%)
Imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (+223.33%)
Sk Dist
Distributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (+333.33%)
Mutual labels:  data-science, spark, scikit-learn
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+2426.67%)
Python Hierarchical Clustering Exercises
Exercises for hierarchical clustering with Python 3 and scipy as Jupyter Notebooks
Stars: ✭ 62 (+3.33%)
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+6.67%)
Mutual labels:  jupyter-notebook, data-science, spark
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (+163.33%)
Mutual labels:  jupyter-notebook, data-science, spark
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+263.33%)
Scikit Learn Tips
🤖⚡️ scikit-learn tips
Stars: ✭ 1,203 (+1905%)
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+36646.67%)
Mutual labels:  data-science, spark, scikit-learn
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+1906.67%)
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-66.67%)
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (+133.33%)
Scikit Learn Videos
Jupyter notebooks from the scikit-learn video series
Stars: ✭ 3,254 (+5323.33%)
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+588.33%)
Mutual labels:  jupyter-notebook, data-science, spark
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-41.67%)
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+405%)
Mutual labels:  jupyter-notebook, spark, scikit-learn
Python Machine Learning Book
The "Python Machine Learning (1st edition)" book code repository and info resource
Stars: ✭ 11,428 (+18946.67%)
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+3795%)
Eli5
A library for debugging/inspecting machine learning classifiers and explaining their predictions
Stars: ✭ 2,477 (+4028.33%)
Virgilio
Virgilio is developed and maintained by these awesome people. You can email us virgilio.datascience (at) gmail.com or join the Discord chat.
Stars: ✭ 13,200 (+21900%)
Ds and ml projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-6.67%)
Thesemicolon
This repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.
Stars: ✭ 345 (+475%)
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (+278.33%)
Mutual labels:  jupyter-notebook, data-science, spark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+9326.67%)
Mutual labels:  jupyter-notebook, data-science, spark
Sklearn Evaluation
Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.
Stars: ✭ 294 (+390%)
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1543.33%)
Mutual labels:  jupyter-notebook, data-science, spark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (+86.67%)
Mutual labels:  jupyter-notebook, data-science, spark
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+3561.67%)
Dive Into Machine Learning
Dive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+17916.67%)
Data Science Projects With Python
A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn
Stars: ✭ 198 (+230%)
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+2130%)
Mutual labels:  jupyter-notebook, data-science, spark
Code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (+378.33%)
Data Science Portfolio
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (+831.67%)
Python Machine Learning Book 2nd Edition
The "Python Machine Learning (2nd edition)" book code repository and info resource
Stars: ✭ 6,422 (+10603.33%)
Tiledb Vcf
Efficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-56.67%)
Mutual labels:  data-science, spark
Resources
PyMC3 educational resources
Stars: ✭ 930 (+1450%)
Mutual labels:  jupyter-notebook, data-science
Awesome Google Colab
Google Colaboratory Notebooks and Repositories (by @firmai)
Stars: ✭ 863 (+1338.33%)
Mutual labels:  jupyter-notebook, data-science
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+13781.67%)
Mutual labels:  jupyter-notebook, data-science
Kubeflow Data Science On Steroids
The blog post about Kubeflow, including all materials
Stars: ✭ 25 (-58.33%)
Mutual labels:  jupyter-notebook, data-science
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1340%)
Mutual labels:  jupyter-notebook, data-science
Icon2017
Repository for the ICON 2017 hackathon 'multivoxel pattern analysis (MVPA) of fMRI data in Python'
Stars: ✭ 14 (-76.67%)
Mutual labels:  jupyter-notebook, scikit-learn
Intro Python
Python pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-65%)
Mutual labels:  jupyter-notebook, data-science
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Stars: ✭ 31,995 (+53225%)
Mutual labels:  jupyter-notebook, scikit-learn
Python Introducing Pandas
Introduction to pandas Treehouse course
Stars: ✭ 24 (-60%)
Mutual labels:  jupyter-notebook, data-science
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-76.67%)
Mutual labels:  jupyter-notebook, spark
Deep learning projects
Stars: ✭ 28 (-53.33%)
Mutual labels:  jupyter-notebook, scikit-learn
Mlnet Workshop
ML.NET Workshop to predict car sales prices
Stars: ✭ 29 (-51.67%)
Mutual labels:  jupyter-notebook, data-science
Mljar Supervised
Automated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning 🚀
Stars: ✭ 961 (+1501.67%)
Mutual labels:  data-science, scikit-learn
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-50%)
Mutual labels:  jupyter-notebook, scikit-learn
Prediciting Binary Options
Predicting forex binary options using time series data and machine learning
Stars: ✭ 33 (-45%)
Mutual labels:  jupyter-notebook, scikit-learn
The Deep Learning With Keras Workshop
An Interactive Approach to Understanding Deep Learning with Keras
Stars: ✭ 34 (-43.33%)
Mutual labels:  jupyter-notebook, scikit-learn
Docker Iocaml Datascience
Dockerfile of Jupyter (IPython notebook) and IOCaml (OCaml kernel) with libraries for data science and machine learning
Stars: ✭ 30 (-50%)
Mutual labels:  jupyter-notebook, data-science
Python Training
Python training for business analysts and traders
Stars: ✭ 972 (+1520%)
Mutual labels:  jupyter-notebook, data-science
Data Science Best Resources
Carefully curated resource links for data science in one place
Stars: ✭ 1,104 (+1740%)
Mutual labels:  data-science, scikit-learn
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (+1536.67%)
Mutual labels:  jupyter-notebook, data-science
Minerva Training Materials
Learn advanced data science on real-life, curated problems
Stars: ✭ 37 (-38.33%)
Mutual labels:  jupyter-notebook, data-science
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-61.67%)
Mutual labels:  jupyter-notebook, spark
1-60 of 7118 similar projects