All Projects → pyspark-asyncactions → Similar Projects or Alternatives

201 Open source projects that are alternatives of or similar to pyspark-asyncactions

spark
Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
Stars: ✭ 609 (+1930%)
Mutual labels:  apache-spark
proxima-platform
The Proxima platform.
Stars: ✭ 17 (-43.33%)
Mutual labels:  apache-spark
machine-learning-course
Machine Learning Course @ Santa Clara University
Stars: ✭ 17 (-43.33%)
Mutual labels:  pyspark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+366.67%)
Mutual labels:  apache-spark
connected-component
Map Reduce Implementation of Connected Component on Apache Spark
Stars: ✭ 68 (+126.67%)
Mutual labels:  apache-spark
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (+93.33%)
Mutual labels:  pyspark
SparkTwitterAnalysis
An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
Stars: ✭ 29 (-3.33%)
Mutual labels:  apache-spark
fink-broker
Astronomy Broker based on Apache Spark
Stars: ✭ 18 (-40%)
Mutual labels:  apache-spark
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (+56.67%)
Mutual labels:  pyspark
oshinko-s2i
This is a place to put s2i images and utilities for spark application builders for openshift
Stars: ✭ 16 (-46.67%)
Mutual labels:  pyspark
Spark Tpc Ds Performance Test
Use the TPC-DS benchmark to test Spark SQL performance
Stars: ✭ 133 (+343.33%)
Mutual labels:  apache-spark
hyperdrive
Extensible streaming ingestion pipeline on top of Apache Spark
Stars: ✭ 31 (+3.33%)
Mutual labels:  apache-spark
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (+326.67%)
Mutual labels:  apache-spark
spark-connector
A connector for Apache Spark to access Exasol
Stars: ✭ 13 (-56.67%)
Mutual labels:  apache-spark
phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
Stars: ✭ 115 (+283.33%)
Mutual labels:  pyspark
Detecting-Malicious-URL-Machine-Learning
No description or website provided.
Stars: ✭ 47 (+56.67%)
Mutual labels:  apache-spark
spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Stars: ✭ 39 (+30%)
Mutual labels:  apache-spark
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+723.33%)
Mutual labels:  apache-spark
cloud-integration
Spark cloud integration: tests, cloud committers and more
Stars: ✭ 20 (-33.33%)
Mutual labels:  apache-spark
Pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+670%)
Mutual labels:  apache-spark
ai-deployment
关注AI模型上线、模型部署
Stars: ✭ 149 (+396.67%)
Mutual labels:  pyspark
databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-36.67%)
Mutual labels:  pyspark
Scala Spark Tutorial
Project for James' Apache Spark with Scala course
Stars: ✭ 121 (+303.33%)
Mutual labels:  apache-spark
pulsar-adapters
Apache Pulsar Adapters
Stars: ✭ 18 (-40%)
Mutual labels:  apache-spark
gan deeplearning4j
Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-36.67%)
Mutual labels:  apache-spark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+5833.33%)
Mutual labels:  apache-spark
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+616.67%)
Mutual labels:  apache-spark
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (+73.33%)
Mutual labels:  pyspark
Analytics Zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Stars: ✭ 2,448 (+8060%)
Mutual labels:  apache-spark
spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
Stars: ✭ 67 (+123.33%)
Mutual labels:  apache-spark
spark-streaming-visualize
Simple demonstration of how to build a complex real time machine learning visualization tool.
Stars: ✭ 16 (-46.67%)
Mutual labels:  apache-spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (+446.67%)
Mutual labels:  apache-spark
BigCLAM-ApacheSpark
Overlapping community detection in Large-Scale Networks using BigCLAM model build on Apache Spark
Stars: ✭ 40 (+33.33%)
Mutual labels:  apache-spark
net.jgp.books.spark.ch01
Spark in Action, 2nd edition - chapter 1 - Introduction
Stars: ✭ 72 (+140%)
Mutual labels:  apache-spark
Albedo
A recommender system for discovering GitHub repos, built with Apache Spark
Stars: ✭ 149 (+396.67%)
Mutual labels:  apache-spark
net.jgp.books.spark.ch07
Spark in Action, 2nd edition - chapter 7 - Ingestion from files
Stars: ✭ 13 (-56.67%)
Mutual labels:  apache-spark
Oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+5850%)
Mutual labels:  apache-spark
ODSC India 2018
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-13.33%)
Mutual labels:  pyspark
Scalable Data Science
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (+373.33%)
Mutual labels:  apache-spark
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (+356.67%)
Mutual labels:  apache-spark
spark-sql-internals
The Internals of Spark SQL
Stars: ✭ 331 (+1003.33%)
Mutual labels:  apache-spark
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+5636.67%)
Mutual labels:  apache-spark
Docker Spark
Apache Spark docker image
Stars: ✭ 1,396 (+4553.33%)
Mutual labels:  apache-spark
Spark-and-Kafka IoT-Data-Processing-and-Analytics
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
Stars: ✭ 42 (+40%)
Mutual labels:  pyspark
flask-spark-docker
Just a boilerplate for PySpark and Flask
Stars: ✭ 32 (+6.67%)
Mutual labels:  pyspark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (+186.67%)
Mutual labels:  apache-spark
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (+250%)
Mutual labels:  apache-spark
anovos
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+156.67%)
Mutual labels:  pyspark
PysparkCheatsheet
PySpark Cheatsheet
Stars: ✭ 25 (-16.67%)
Mutual labels:  apache-spark
OSCI
Open Source Contributor Index
Stars: ✭ 107 (+256.67%)
Mutual labels:  pyspark
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (+176.67%)
Mutual labels:  apache-spark
Mlflow
Open source platform for the machine learning lifecycle
Stars: ✭ 10,898 (+36226.67%)
Mutual labels:  apache-spark
Awesome Pulsar
A curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (+90%)
Mutual labels:  apache-spark
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+13.33%)
Mutual labels:  pyspark
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-16.67%)
Mutual labels:  pyspark
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+140%)
Mutual labels:  pyspark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (+83.33%)
Mutual labels:  apache-spark
Sparkit Learn
PySpark + Scikit-learn = Sparkit-learn
Stars: ✭ 1,073 (+3476.67%)
Mutual labels:  apache-spark
kafka-twitter-spark-streaming
Counting Tweets Per User in Real-Time
Stars: ✭ 38 (+26.67%)
Mutual labels:  pyspark
geospark
bring sf to spark in production
Stars: ✭ 53 (+76.67%)
Mutual labels:  apache-spark
61-120 of 201 similar projects