All Projects → Spark Py Notebooks → Similar Projects or Alternatives

7809 Open source projects that are alternatives of or similar to Spark Py Notebooks

Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-69.13%)
Mutual labels:  jupyter-notebook, data-science, spark
Data Science Notebook
📖 每一个伟大的思想和行动都有一个微不足道的开始
Stars: ✭ 196 (-85.35%)
Mutual labels:  data-science, data-analysis, notebook
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-83.86%)
Mutual labels:  spark, big-data, pyspark
Koalas
Koalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+127.5%)
Mutual labels:  data-science, spark, big-data
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-83.93%)
Mutual labels:  spark, big-data, bigdata
Bayesian Cognitive Modeling In Pymc3
PyMC3 codes of Lee and Wagenmakers' Bayesian Cognitive Modeling - A Pratical Course
Stars: ✭ 93 (-93.05%)
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-91.48%)
Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+28.25%)
Show ast
An IPython notebook plugin for visualizing ASTs.
Stars: ✭ 76 (-94.32%)
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-89.76%)
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-90.43%)
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (-88.86%)
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+116.67%)
Mutual labels:  spark, pyspark, big-data
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (-88.19%)
Mutual labels:  jupyter-notebook, data-science, spark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-88.19%)
Mutual labels:  jupyter-notebook, spark, pyspark
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-87.22%)
Data Science Your Way
Ways of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (-60.39%)
Digital Signal Processing Lecture
Digital Signal Processing - Theory and Computational Examples
Stars: ✭ 532 (-60.24%)
Mutual labels:  ipython, jupyter-notebook, notebook
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-83.71%)
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-85.05%)
Mutual labels:  jupyter-notebook, spark, pyspark
Ipyexperiments
jupyter/ipython experiment containers for GPU and general RAM re-use
Stars: ✭ 128 (-90.43%)
Mutual labels:  ipython, jupyter-notebook, notebook
Pachyderm
Reproducible Data Science at Scale!
Stars: ✭ 5,305 (+296.49%)
Mutual labels:  data-science, data-analysis, big-data
Ipywebrtc
WebRTC for Jupyter notebook/lab
Stars: ✭ 171 (-87.22%)
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-95.29%)
Mutual labels:  jupyter-notebook, spark, pyspark
Signals And Systems Lecture
Continuous- and Discrete-Time Signals and Systems - Theory and Computational Examples
Stars: ✭ 166 (-87.59%)
Mutual labels:  ipython, jupyter-notebook, notebook
Notebooks Statistics And Machinelearning
Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog
Stars: ✭ 270 (-79.82%)
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-91.7%)
Mutual labels:  big-data, spark, pyspark
Datascience course
Curso de Data Science em Português
Stars: ✭ 294 (-78.03%)
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-68.16%)
Mutual labels:  jupyter-notebook, big-data, bigdata
Tutorials
CatBoost tutorials repository
Stars: ✭ 563 (-57.92%)
Notebooks
A collection of Jupyter/IPython notebooks
Stars: ✭ 78 (-94.17%)
Datacamp
🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-94.84%)
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-77.35%)
Mutual labels:  jupyter-notebook, spark, data-analysis
Nbstripout
strip output from Jupyter and IPython notebooks
Stars: ✭ 738 (-44.84%)
Vscodejupyter
Jupyter for Visual Studio Code
Stars: ✭ 337 (-74.81%)
The Elements Of Statistical Learning Python Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
Stars: ✭ 405 (-69.73%)
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+130.27%)
Mutual labels:  data-science, spark, notebook
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (-10.01%)
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+1547.83%)
Mutual labels:  data-science, spark, big-data
Jupyter pivottablejs
Drag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (-68.01%)
Jupyterlab Lsp
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
Stars: ✭ 796 (-40.51%)
Mutual labels:  ipython, jupyter-notebook, notebook
Jupytemplate
Templates for jupyter notebooks
Stars: ✭ 85 (-93.65%)
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-98.28%)
Mutual labels:  jupyter-notebook, spark, pyspark
Skdata
Python tools for data analysis
Stars: ✭ 16 (-98.8%)
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-98.21%)
Mutual labels:  jupyter-notebook, big-data, pyspark
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-36.17%)
Mutual labels:  data-science, data-analysis, big-data
Sciblog support
Support content for my blog
Stars: ✭ 694 (-48.13%)
Ipython Dashboard
A stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.
Stars: ✭ 664 (-50.37%)
Mutual labels:  ipython, data-science, notebook
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+522.5%)
Ansible Jupyterhub
Ansible role to setup jupyterhub server (deprecated)
Stars: ✭ 14 (-98.95%)
Lambdaschooldatascience
Completed assignments and coding challenges from the Lambda School Data Science program.
Stars: ✭ 22 (-98.36%)
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-98.65%)
Resources
PyMC3 educational resources
Stars: ✭ 930 (-30.49%)
Cryptocurrency Analysis Python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
Stars: ✭ 278 (-79.22%)
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-52.69%)
Mutual labels:  data-science, spark, pyspark
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-35.43%)
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-93.2%)
Mutual labels:  jupyter-notebook, big-data, pyspark
Starcraft2 Replay Analysis
A jupyter notebook that provides analysis for StarCraft 2 replays
Stars: ✭ 90 (-93.27%)
Drugs Recommendation Using Reviews
Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-97.38%)
Mutual labels:  jupyter-notebook, data-analysis
Vagrant Projects
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
Stars: ✭ 34 (-97.46%)
Mutual labels:  ipython, spark
61-120 of 7809 similar projects