All Projects → spark3D → Similar Projects or Alternatives

421 Open source projects that are alternatives of or similar to spark3D

learn-by-examples
Real-world Spark pipelines examples
Stars: ✭ 84 (+265.22%)
Mutual labels:  apache-spark, pyspark
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+69.57%)
Mutual labels:  apache-spark, pyspark
mmtf-workshop-2018
Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (+117.39%)
Mutual labels:  apache-spark, pyspark
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+382.61%)
Mutual labels:  apache-spark, pyspark
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+121.74%)
Mutual labels:  apache-spark, pyspark
pyspark-asyncactions
Asynchronous actions for PySpark
Stars: ✭ 30 (+30.43%)
Mutual labels:  apache-spark, pyspark
Awesome Spark
A curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+4513.04%)
Mutual labels:  apache-spark, pyspark
Quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (+843.48%)
Mutual labels:  apache-spark, pyspark
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+14486.96%)
Mutual labels:  apache-spark, pyspark
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (+139.13%)
Mutual labels:  apache-spark, pyspark
mmtf-spark
Methods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Stars: ✭ 20 (-13.04%)
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+12504.35%)
Mutual labels:  apache-spark, pyspark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+552.17%)
Mutual labels:  apache-spark, pyspark
Pyspark Boilerplate
A boilerplate for writing PySpark Jobs
Stars: ✭ 318 (+1282.61%)
Mutual labels:  apache-spark, pyspark
Spark-for-data-engineers
Apache Spark for data engineers
Stars: ✭ 22 (-4.35%)
Mutual labels:  apache-spark, pyspark
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (+21.74%)
Mutual labels:  apache-spark, pyspark
Pyspark Stubs
Apache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (+326.09%)
Mutual labels:  apache-spark, pyspark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (+617.39%)
Mutual labels:  apache-spark, pyspark
pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+400%)
Mutual labels:  apache-spark, pyspark
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+239.13%)
Mutual labels:  apache-spark, pyspark
Spark Gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (+1239.13%)
Mutual labels:  apache-spark, pyspark
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-39.13%)
Mutual labels:  apache-spark, pyspark
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (+495.65%)
Mutual labels:  apache-spark
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+834.78%)
Mutual labels:  apache-spark
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+7382.61%)
Mutual labels:  apache-spark
Scala Spark Tutorial
Project for James' Apache Spark with Scala course
Stars: ✭ 121 (+426.09%)
Mutual labels:  apache-spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+508.7%)
Mutual labels:  apache-spark
seamless
Seamless is a framework to set up reproducible computations (and visualizations) that respond to changes in cells. Cells contain the input data as well as the source code of the computations, and all cells can be edited interactively.
Stars: ✭ 19 (-17.39%)
Mutual labels:  scientific-computing
Analytics Zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Stars: ✭ 2,448 (+10543.48%)
Mutual labels:  apache-spark
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (+356.52%)
Mutual labels:  apache-spark
Spark Tpc Ds Performance Test
Use the TPC-DS benchmark to test Spark SQL performance
Stars: ✭ 133 (+478.26%)
Mutual labels:  apache-spark
spark-dgraph-connector
A connector for Apache Spark and PySpark to Dgraph databases.
Stars: ✭ 36 (+56.52%)
Mutual labels:  pyspark
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (+456.52%)
Mutual labels:  apache-spark
Learning Apache Spark
Notes on Apache Spark (pyspark)
Stars: ✭ 211 (+817.39%)
Mutual labels:  apache-spark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+7639.13%)
Mutual labels:  apache-spark
kdtree
A pure Nim k-d tree implementation for efficient spatial querying of point data
Stars: ✭ 40 (+73.91%)
Mutual labels:  spatial-data
Sparktorch
Train and run Pytorch models on Apache Spark.
Stars: ✭ 195 (+747.83%)
Mutual labels:  apache-spark
Docker Spark
Apache Spark docker image
Stars: ✭ 1,396 (+5969.57%)
Mutual labels:  apache-spark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (+273.91%)
Mutual labels:  apache-spark
spinmob
Rapid and flexible acquisition, analysis, fitting, and plotting in Python. Designed for scientific laboratories.
Stars: ✭ 34 (+47.83%)
Mutual labels:  scientific-computing
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+669.57%)
Mutual labels:  apache-spark
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (+260.87%)
Mutual labels:  apache-spark
Mlflow
Open source platform for the machine learning lifecycle
Stars: ✭ 10,898 (+47282.61%)
Mutual labels:  apache-spark
Awesome Pulsar
A curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (+147.83%)
Mutual labels:  apache-spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (+139.13%)
Mutual labels:  apache-spark
spark-connector
A connector for Apache Spark to access Exasol
Stars: ✭ 13 (-43.48%)
Mutual labels:  apache-spark
workshop-spark
Código para workshops Spark com ambiente de desenvolvimento em docker
Stars: ✭ 27 (+17.39%)
Mutual labels:  pyspark
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+973.91%)
Mutual labels:  apache-spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (+613.04%)
Mutual labels:  apache-spark
Sparkit Learn
PySpark + Scikit-learn = Sparkit-learn
Stars: ✭ 1,073 (+4565.22%)
Mutual labels:  apache-spark
Spark Atlas Connector
A Spark Atlas connector to track data lineage in Apache Atlas
Stars: ✭ 160 (+595.65%)
Mutual labels:  apache-spark
Spark Nkp
Natural Korean Processor for Apache Spark
Stars: ✭ 50 (+117.39%)
Mutual labels:  apache-spark
Spark Sklearn
(Deprecated) Scikit-learn integration package for Apache Spark
Stars: ✭ 1,055 (+4486.96%)
Mutual labels:  apache-spark
Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (+917.39%)
Mutual labels:  apache-spark
Cheatsheets.pdf
📚 Various cheatsheets in PDF
Stars: ✭ 159 (+591.3%)
Mutual labels:  apache-spark
Apache Spark Internals
The Internals of Apache Spark
Stars: ✭ 1,045 (+4443.48%)
Mutual labels:  apache-spark
Spark As Service Using Embedded Server
This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server
Stars: ✭ 46 (+100%)
Mutual labels:  apache-spark
Spark Scala Maven Example
Example Maven configuration for a Spark, Scala project
Stars: ✭ 45 (+95.65%)
Mutual labels:  apache-spark
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (+95.65%)
Mutual labels:  apache-spark
Detecting-Malicious-URL-Machine-Learning
No description or website provided.
Stars: ✭ 47 (+104.35%)
Mutual labels:  apache-spark
1-60 of 421 similar projects