All Projects → Spark Py Notebooks → Similar Projects or Alternatives

7809 Open source projects that are alternatives of or similar to Spark Py Notebooks

Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-91.85%)
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-26.31%)
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-47.38%)
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+322.72%)
Data Analysis And Machine Learning Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Stars: ✭ 5,166 (+286.1%)
Sci Pype
A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
Stars: ✭ 90 (-93.27%)
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-88.79%)
Mutual labels:  jupyter-notebook, spark, big-data, pyspark
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-59.57%)
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-94.69%)
Mutual labels:  jupyter-notebook, spark, big-data, bigdata
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-91.55%)
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (-28.7%)
Mutual labels:  jupyter-notebook, spark, notebook, pyspark
Datasciencevm
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-88.57%)
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-44.32%)
Mutual labels:  jupyter-notebook, spark, big-data, bigdata
Quantitative Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (-73.39%)
Nteract
📘 The interactive computing suite for you! ✨
Stars: ✭ 5,713 (+326.98%)
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-66.07%)
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (-12.18%)
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-94.1%)
Mutual labels:  data-science, spark, data-analysis, big-data
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+114.05%)
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-92%)
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-99.03%)
Mutual labels:  big-data, spark, bigdata, data-analysis
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-95.22%)
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-94.84%)
Mutual labels:  data-analysis, big-data, bigdata
Datacamp
🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-94.84%)
Allstate capstone
Allstate Kaggle Competition ML Capstone Project
Stars: ✭ 72 (-94.62%)
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+1547.83%)
Mutual labels:  data-science, spark, big-data
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-69.13%)
Mutual labels:  jupyter-notebook, data-science, spark
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-68.16%)
Mutual labels:  jupyter-notebook, big-data, bigdata
Data Science Your Way
Ways of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (-60.39%)
Digital Signal Processing Lecture
Digital Signal Processing - Theory and Computational Examples
Stars: ✭ 532 (-60.24%)
Mutual labels:  ipython, jupyter-notebook, notebook
The Elements Of Statistical Learning Python Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
Stars: ✭ 405 (-69.73%)
Jupyter pivottablejs
Drag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (-68.01%)
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (-76.46%)
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-52.69%)
Mutual labels:  data-science, spark, pyspark
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (-10.01%)
Notebooks
A collection of Jupyter/IPython notebooks
Stars: ✭ 78 (-94.17%)
Tutorials
CatBoost tutorials repository
Stars: ✭ 563 (-57.92%)
Sciblog support
Support content for my blog
Stars: ✭ 694 (-48.13%)
Ipython Dashboard
A stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.
Stars: ✭ 664 (-50.37%)
Mutual labels:  ipython, data-science, notebook
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (-73.84%)
Pachyderm
Reproducible Data Science at Scale!
Stars: ✭ 5,305 (+296.49%)
Mutual labels:  data-science, data-analysis, big-data
Nbstripout
strip output from Jupyter and IPython notebooks
Stars: ✭ 738 (-44.84%)
Show ast
An IPython notebook plugin for visualizing ASTs.
Stars: ✭ 76 (-94.32%)
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-98.28%)
Mutual labels:  jupyter-notebook, spark, pyspark
Lambdaschooldatascience
Completed assignments and coding challenges from the Lambda School Data Science program.
Stars: ✭ 22 (-98.36%)
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-98.21%)
Mutual labels:  jupyter-notebook, big-data, pyspark
Jupytemplate
Templates for jupyter notebooks
Stars: ✭ 85 (-93.65%)
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-98.65%)
Resources
PyMC3 educational resources
Stars: ✭ 930 (-30.49%)
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-36.17%)
Mutual labels:  data-science, data-analysis, big-data
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+522.5%)
Ansible Jupyterhub
Ansible role to setup jupyterhub server (deprecated)
Stars: ✭ 14 (-98.95%)
Skdata
Python tools for data analysis
Stars: ✭ 16 (-98.8%)
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-35.43%)
Minerva Training Materials
Learn advanced data science on real-life, curated problems
Stars: ✭ 37 (-97.23%)
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (-25.41%)
Mutual labels:  jupyter-notebook, data-science, spark
Telepyth
Telegram notification with IPython magics.
Stars: ✭ 54 (-95.96%)
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-95.52%)
Mutual labels:  jupyter-notebook, data-science, spark
Starcraft2 Replay Analysis
A jupyter notebook that provides analysis for StarCraft 2 replays
Stars: ✭ 90 (-93.27%)
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-96.34%)
1-60 of 7809 similar projects