All Projects → Scalable Data Science Platform → Similar Projects or Alternatives

7400 Open source projects that are alternatives of or similar to Scalable Data Science Platform

Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+531.65%)
Mutual labels:  jupyter-notebook, data-science, spark
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+524.05%)
Mutual labels:  jupyter-notebook, data-science, spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+161.39%)
Mutual labels:  jupyter-notebook, data-science, spark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-29.11%)
Mutual labels:  jupyter-notebook, data-science, spark
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (+43.67%)
Mutual labels:  jupyter-notebook, data-science, spark
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-59.49%)
Mutual labels:  jupyter-notebook, data-science, spark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+3479.75%)
Mutual labels:  jupyter-notebook, data-science, spark
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-62.03%)
Mutual labels:  jupyter-notebook, data-science, spark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+746.84%)
Mutual labels:  jupyter-notebook, data-science, spark
Fastbook
The fastai book, published as Jupyter Notebooks
Stars: ✭ 13,998 (+8759.49%)
Mutual labels:  jupyter-notebook, data-science
Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+986.08%)
Mutual labels:  jupyter-notebook, data-science
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-22.15%)
Mutual labels:  jupyter-notebook, data-science
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-28.48%)
Mutual labels:  jupyter-notebook, data-science
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-27.85%)
Mutual labels:  jupyter-notebook, data-science
Zigzag
Python library for identifying the peaks and valleys of a time series.
Stars: ✭ 156 (-1.27%)
Mutual labels:  jupyter-notebook, data-science
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+859.49%)
Mutual labels:  jupyter-notebook, data-science
Programming With Data
🐍 Learn Python and Pandas from the ground up
Stars: ✭ 156 (-1.27%)
Mutual labels:  jupyter-notebook, data-science
Seq2seq tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-16.46%)
Mutual labels:  jupyter-notebook, data-science
2016 Ml Contest
Machine learning contest - October 2016 TLE
Stars: ✭ 135 (-14.56%)
Mutual labels:  jupyter-notebook, data-science
Python For Data Science
A blog for data analytics using data science technologies
Stars: ✭ 139 (-12.03%)
Mutual labels:  jupyter-notebook, data-science
Dive Into Machine Learning
Dive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+6741.77%)
Mutual labels:  jupyter-notebook, data-science
Data Science Wg
SF Brigade's Data Science Working Group.
Stars: ✭ 135 (-14.56%)
Mutual labels:  jupyter-notebook, data-science
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-11.39%)
Mutual labels:  jupyter-notebook, data-science
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-12.03%)
Mutual labels:  jupyter-notebook, spark
Jupyter
Stars: ✭ 145 (-8.23%)
Mutual labels:  jupyter-notebook, data-science
Scipy con 2019
Tutorial Sessions for SciPy Con 2019
Stars: ✭ 142 (-10.13%)
Mutual labels:  jupyter-notebook, data-science
Python Machine Learning Book
The "Python Machine Learning (1st edition)" book code repository and info resource
Stars: ✭ 11,428 (+7132.91%)
Mutual labels:  jupyter-notebook, data-science
Loandefault Prediction
Lending Club Loan data analysis
Stars: ✭ 113 (-28.48%)
Mutual labels:  jupyter-notebook, data-science
Kaggle Houseprices
Kaggle Kernel for House Prices competition https://www.kaggle.com/massquantity/all-you-need-is-pca-lb-0-11421-top-4
Stars: ✭ 113 (-28.48%)
Mutual labels:  jupyter-notebook, data-science
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-3.8%)
Mutual labels:  data-science, spark
Algocode
Welcome everyone!🌟 Here you can solve problems, build scrappers and much more💻
Stars: ✭ 113 (-28.48%)
Mutual labels:  jupyter-notebook, data-science
Elassandra
Elassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+918.99%)
Mutual labels:  rest-api, spark
Automunge
Artificial Learning, Intelligent Machines
Stars: ✭ 119 (-24.68%)
Mutual labels:  jupyter-notebook, data-science
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-22.78%)
Mutual labels:  data-science, spark
Krisk
Statistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.
Stars: ✭ 111 (-29.75%)
Mutual labels:  jupyter-notebook, data-science
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-18.99%)
Mutual labels:  jupyter-notebook, data-science
Data Science For Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Stars: ✭ 127 (-19.62%)
Mutual labels:  jupyter-notebook, data-science
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-14.56%)
Mutual labels:  jupyter-notebook, data-science
Cape Python
Collaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-20.89%)
Mutual labels:  data-science, spark
Machine Learning And Data Science
This is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-13.29%)
Mutual labels:  jupyter-notebook, data-science
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-13.29%)
Mutual labels:  jupyter-notebook, data-science
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+1647.47%)
Mutual labels:  jupyter-notebook, data-science
Cheat Sheets
Developer Cheatsheets
Stars: ✭ 145 (-8.23%)
Mutual labels:  jupyter-notebook, data-science
Data Science Question Answer
A repo for data science related questions and answers
Stars: ✭ 2,000 (+1165.82%)
Mutual labels:  jupyter-notebook, data-science
Textbook
Principles and Techniques of Data Science, the textbook for Data 100 at UC Berkeley
Stars: ✭ 145 (-8.23%)
Mutual labels:  jupyter-notebook, data-science
Machine learning for good
Machine learning fundamentals lesson in interactive notebooks
Stars: ✭ 142 (-10.13%)
Mutual labels:  jupyter-notebook, data-science
Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-6.96%)
Mutual labels:  data-science, spark
Pyspark Learning
Updated repository
Stars: ✭ 147 (-6.96%)
Mutual labels:  jupyter-notebook, spark
Nyc Transport
A Unified Database of NYC transport (subway, taxi/Uber, and citibike) data.
Stars: ✭ 148 (-6.33%)
Mutual labels:  jupyter-notebook, data-science
Machine Learning
🌎 machine learning tutorials (mainly in Python3)
Stars: ✭ 1,924 (+1117.72%)
Mutual labels:  jupyter-notebook, data-science
Raspberryturk
The Raspberry Turk is a robot that can play chess—it's entirely open source, based on Raspberry Pi, and inspired by the 18th century chess playing machine, the Mechanical Turk.
Stars: ✭ 140 (-11.39%)
Mutual labels:  jupyter-notebook, data-science
Fantasy Basketball
Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (-7.59%)
Mutual labels:  jupyter-notebook, data-science
Project kojak
Training a Neural Network to Detect Gestures and Control Smart Home Devices with OpenCV in Python
Stars: ✭ 147 (-6.96%)
Mutual labels:  jupyter-notebook, data-science
Testovoe
Home assignments for data science positions
Stars: ✭ 149 (-5.7%)
Mutual labels:  jupyter-notebook, data-science
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-5.06%)
Mutual labels:  jupyter-notebook, spark
The Python Workshop
A New, Interactive Approach to Learning Python
Stars: ✭ 150 (-5.06%)
Mutual labels:  jupyter-notebook, data-science
Benchm Ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+1061.39%)
Mutual labels:  data-science, spark
Py Quantmod
Powerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (-1.9%)
Mutual labels:  jupyter-notebook, data-science
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+1379.11%)
Mutual labels:  jupyter-notebook, data-science
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+1290.51%)
Mutual labels:  jupyter-notebook, data-science
1-60 of 7400 similar projects