Spark Jupyter AwsA guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (+12.12%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-93.94%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+312.99%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+332.03%)
HelkThe Hunting ELK
Stars: ✭ 3,097 (+1240.69%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+222.51%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+479.22%)
Cs231nMy Solution to Assignments of CS231n in Winter2016
Stars: ✭ 71 (-69.26%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-1.73%)
Spark Nlp ModelsModels and Pipelines for the Spark NLP library
Stars: ✭ 88 (-61.9%)
Data science blogsA repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-39.83%)
Ml Workspace🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+911.69%)
JustenoughscalaforsparkA tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+132.9%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+2348.48%)
Crime AnalysisAssociation Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-91.34%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-72.29%)
Starter Academic🎓 Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify
Stars: ✭ 1,158 (+401.3%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-59.74%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-63.64%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-31.6%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-35.06%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-74.89%)
AlmondA Scala kernel for Jupyter
Stars: ✭ 1,354 (+486.15%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-72.73%)
JupytextJupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Stars: ✭ 4,969 (+2051.08%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+78.79%)
Enterprise gatewayA lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Stars: ✭ 412 (+78.35%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+31.17%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+326.84%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-51.52%)
Repo 2018Deep Learning Summer School + Tensorflow + OpenCV cascade training + YOLO + COCO + CycleGAN + AWS EC2 Setup + AWS IoT Project + AWS SageMaker + AWS API Gateway + Raspberry Pi3 Ubuntu Core
Stars: ✭ 163 (-29.44%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-13.42%)
AlphatoolsQuantitative finance research tools in Python
Stars: ✭ 226 (-2.16%)
1833518.335 - Introduction to Numerical Methods course
Stars: ✭ 228 (-1.3%)
Gpt2botYour new Telegram buddy powered by transformers
Stars: ✭ 228 (-1.3%)
Applied Reinforcement LearningReinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (-0.87%)
Pydata BookMaterials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
Stars: ✭ 16,386 (+6993.51%)
Neural Network From ScratchEver wondered how to code your Neural Network using NumPy, with no frameworks involved?
Stars: ✭ 230 (-0.43%)
StructuredinferenceStructured Inference Networks for Nonlinear State Space Models
Stars: ✭ 230 (-0.43%)
DataData and code behind the articles and graphics at FiveThirtyEight
Stars: ✭ 15,241 (+6497.84%)