All Projects → Pysparkling → Similar Projects or Alternatives

1030 Open source projects that are alternatives of or similar to Pysparkling

Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-76.19%)
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+78.79%)
Mutual labels:  data-science, apache-spark
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+165.37%)
Mutual labels:  data-science, apache-spark
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+269.7%)
Mutual labels:  data-science, data-processing
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+274.03%)
Mutual labels:  data-science, data-processing
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+1233.77%)
Mutual labels:  data-science, apache-spark
Scalable Data Science
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (-38.53%)
Mutual labels:  data-science, apache-spark
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (+71.86%)
Mutual labels:  apache-spark, data-processing
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1632.9%)
Mutual labels:  data-science, data-processing
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-44.59%)
Mutual labels:  data-science, apache-spark
Collapse
Advanced and Fast Data Transformation in R
Stars: ✭ 184 (-20.35%)
Mutual labels:  data-science, data-processing
Spark Workshop
Apache Spark™ and Scala Workshops
Stars: ✭ 224 (-3.03%)
Mutual labels:  apache-spark
Covid19za
Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
Stars: ✭ 208 (-9.96%)
Mutual labels:  data-science
Eli5
A library for debugging/inspecting machine learning classifiers and explaining their predictions
Stars: ✭ 2,477 (+972.29%)
Mutual labels:  data-science
Scihub
Source code and data analyses for the Sci-Hub Coverage Study
Stars: ✭ 205 (-11.26%)
Mutual labels:  data-science
Streamlit
Streamlit — The fastest way to build data apps in Python
Stars: ✭ 16,906 (+7218.61%)
Mutual labels:  data-science
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-3.9%)
Mutual labels:  data-processing
Python For Data Science
A collection of Jupyter Notebooks for learning Python for Data Science.
Stars: ✭ 205 (-11.26%)
Mutual labels:  data-science
Estadistica Con R
Apuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (-12.99%)
Mutual labels:  data-science
Ml Workspace
Machine Learning (Beginners Hub), information(courses, books, cheat sheets, live sessions) related to machine learning, data science and python is available
Stars: ✭ 221 (-4.33%)
Mutual labels:  data-science
Lightautoml
LAMA - automatic model creation framework
Stars: ✭ 196 (-15.15%)
Mutual labels:  data-science
Cml
♾️ CML - Continuous Machine Learning | CI/CD for ML
Stars: ✭ 2,843 (+1130.74%)
Mutual labels:  data-science
R4ds Exercise Solutions
Exercise solutions to "R for Data Science"
Stars: ✭ 226 (-2.16%)
Mutual labels:  data-science
Gspread Pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-2.16%)
Mutual labels:  data-science
Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (-6.06%)
Mutual labels:  data-science
Trump Lies
Tutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (-12.99%)
Mutual labels:  data-science
Cartoframes
CARTO Python package for data scientists
Stars: ✭ 208 (-9.96%)
Mutual labels:  data-science
Awesome Ai Infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
Stars: ✭ 223 (-3.46%)
Mutual labels:  apache-spark
Flaml
A fast and lightweight AutoML library.
Stars: ✭ 205 (-11.26%)
Mutual labels:  data-science
Dash
Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required.
Stars: ✭ 15,592 (+6649.78%)
Mutual labels:  data-science
Compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Stars: ✭ 203 (-12.12%)
Mutual labels:  data-science
Statistical Learning
Lecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
Stars: ✭ 223 (-3.46%)
Mutual labels:  data-science
Tsfel
An intuitive library to extract features from time series
Stars: ✭ 202 (-12.55%)
Mutual labels:  data-science
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-1.73%)
Mutual labels:  data-science
Laurae
Advanced High Performance Data Science Toolbox for R by Laurae
Stars: ✭ 203 (-12.12%)
Mutual labels:  data-science
Jupyterlab templates
Support for jupyter notebook templates in jupyterlab
Stars: ✭ 223 (-3.46%)
Mutual labels:  data-science
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+1154.98%)
Mutual labels:  apache-spark
Elastic
R client for the Elasticsearch HTTP API
Stars: ✭ 227 (-1.73%)
Mutual labels:  data-science
Instascrape
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (-12.55%)
Mutual labels:  data-science
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-5.63%)
Mutual labels:  data-science
Fastpages
An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
Stars: ✭ 2,888 (+1150.22%)
Mutual labels:  data-science
Webstruct
NER toolkit for HTML data
Stars: ✭ 230 (-0.43%)
Mutual labels:  data-science
Quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (-6.06%)
Mutual labels:  apache-spark
Achoo
Achoo uses a Raspberry Pi to predict if my son will need his inhaler on any given day using weather, pollen, and air quality data. If the prediction for a given day is above a specified threshold, the Pi will email his school nurse, and myself, notifying her that he may need preemptive treatment. Community-sourced health monitoring!
Stars: ✭ 200 (-13.42%)
Mutual labels:  data-science
Lale
Library for Semi-Automated Data Science
Stars: ✭ 198 (-14.29%)
Mutual labels:  data-science
Ml Auto Baseball Pitching Overlay
⚾🤖⚾ Automatic baseball pitching overlay in realtime
Stars: ✭ 200 (-13.42%)
Mutual labels:  data-science
Full Stack Data Science
Full Stack Data Science in Python
Stars: ✭ 227 (-1.73%)
Mutual labels:  data-science
Cardio
CardIO is a library for data science research of heart signals
Stars: ✭ 218 (-5.63%)
Mutual labels:  data-science
Radio
RadIO is a library for data science research of computed tomography imaging
Stars: ✭ 198 (-14.29%)
Mutual labels:  data-science
Pytorch Geometric Yoochoose
This is a tutorial for PyTorch Geometric on the YooChoose dataset
Stars: ✭ 198 (-14.29%)
Mutual labels:  data-science
Chord
Python package for creating beautiful interactive Chord Diagrams. Pro version available at https://m8.fyi/chord
Stars: ✭ 217 (-6.06%)
Mutual labels:  data-science
Data Science Projects With Python
A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn
Stars: ✭ 198 (-14.29%)
Mutual labels:  data-science
Analytics Zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Stars: ✭ 2,448 (+959.74%)
Mutual labels:  apache-spark
Functional intro to python
[tutorial]A functional, Data Science focused introduction to Python
Stars: ✭ 228 (-1.3%)
Mutual labels:  data-science
Machine Learning Resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-2.16%)
Mutual labels:  data-science
Tutorials
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
Stars: ✭ 204 (-11.69%)
Mutual labels:  data-science
Cql
Categorical Query Language IDE
Stars: ✭ 196 (-15.15%)
Mutual labels:  data-science
Climate Change Data
🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (-15.58%)
Mutual labels:  data-science
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-6.93%)
Mutual labels:  apache-spark
Tad
A desktop application for viewing and analyzing tabular data
Stars: ✭ 2,275 (+884.85%)
Mutual labels:  data-science
1-60 of 1030 similar projects