All Projects → spark-utils → Similar Projects or Alternatives

180 Open source projects that are alternatives of or similar to spark-utils

Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+3616%)
Mutual labels:  apache-spark, spark-streaming
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+13172%)
Mutual labels:  apache-spark, spark-streaming
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+460%)
Mutual labels:  apache-spark, spark-streaming
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (+232%)
Mutual labels:  apache-spark, spark-streaming
Streaming Readings
Streaming System 相关的论文读物
Stars: ✭ 554 (+2116%)
Mutual labels:  apache-spark, spark-streaming
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+888%)
Mutual labels:  apache-spark, spark-streaming
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (+48%)
Mutual labels:  apache-spark, spark-streaming
Spark Streaming Monitoring With Lightning
Plot live-stats as graph from ApacheSpark application using Lightning-viz
Stars: ✭ 15 (-40%)
Mutual labels:  apache-spark, spark-streaming
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+6784%)
Mutual labels:  apache-spark, spark-streaming
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+608%)
Mutual labels:  apache-spark, spark-streaming
learning-hadoop-and-spark
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+484%)
Mutual labels:  apache-spark
osm-parquetizer
A converter for the OSM PBFs to Parquet files
Stars: ✭ 71 (+184%)
Mutual labels:  apache-spark
BigInsights-on-Apache-Hadoop
Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix
Stars: ✭ 21 (-16%)
Mutual labels:  spark-streaming
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+212%)
Mutual labels:  apache-spark
awesome-tools
curated list of awesome tools and libraries for specific domains
Stars: ✭ 31 (+24%)
Mutual labels:  apache-spark
bitnami-docker-spark
Bitnami Docker Image for Apache Spark
Stars: ✭ 239 (+856%)
Mutual labels:  spark-streaming
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (+120%)
Mutual labels:  apache-spark
learn-by-examples
Real-world Spark pipelines examples
Stars: ✭ 84 (+236%)
Mutual labels:  apache-spark
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+56%)
Mutual labels:  apache-spark
mmtf-spark
Methods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Stars: ✭ 20 (-20%)
Mutual labels:  apache-spark
fink-broker
Astronomy Broker based on Apache Spark
Stars: ✭ 18 (-28%)
Mutual labels:  apache-spark
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-4%)
Mutual labels:  apache-spark
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-24%)
Mutual labels:  spark-streaming
qs-hadoop
大数据生态圈学习
Stars: ✭ 18 (-28%)
Mutual labels:  spark-streaming
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (+12%)
Mutual labels:  apache-spark
ExDeMon
A general purpose metrics monitor implemented with Apache Spark. Kafka source, Elastic sink, aggregate metrics, different analysis, notifications, actions, live configuration update, missing metrics, ...
Stars: ✭ 19 (-24%)
Mutual labels:  spark-streaming
SANSA-Stack
Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Stars: ✭ 130 (+420%)
Mutual labels:  apache-spark
BigCLAM-ApacheSpark
Overlapping community detection in Large-Scale Networks using BigCLAM model build on Apache Spark
Stars: ✭ 40 (+60%)
Mutual labels:  apache-spark
sparklygraphs
Old repo for R interface for GraphFrames
Stars: ✭ 13 (-48%)
Mutual labels:  apache-spark
hyperdrive
Extensible streaming ingestion pipeline on top of Apache Spark
Stars: ✭ 31 (+24%)
Mutual labels:  apache-spark
proxima-platform
The Proxima platform.
Stars: ✭ 17 (-32%)
Mutual labels:  apache-spark
net.jgp.books.spark.ch07
Spark in Action, 2nd edition - chapter 7 - Ingestion from files
Stars: ✭ 13 (-48%)
Mutual labels:  apache-spark
Spark ALS
基于spark-ml,spark-mllib,spark-streaming的推荐算法实现
Stars: ✭ 89 (+256%)
Mutual labels:  spark-streaming
SparkTwitterAnalysis
An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
Stars: ✭ 29 (+16%)
Mutual labels:  apache-spark
streamsx.kafka
Repository for integration with Apache Kafka
Stars: ✭ 13 (-48%)
Mutual labels:  apache-spark
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+104%)
Mutual labels:  apache-spark
Location-based-Restaurants-Recommendation-System
Big Data Management and Analysis Final Project
Stars: ✭ 44 (+76%)
Mutual labels:  apache-spark
wakib-keys
Emacs mode that moves to modern keybindings
Stars: ✭ 31 (+24%)
Mutual labels:  convenience
Real-time-log-analysis-system
🐧基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于springboot、Echarts等的Web UI可视化版本)
Stars: ✭ 31 (+24%)
Mutual labels:  spark-streaming
gan deeplearning4j
Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-24%)
Mutual labels:  apache-spark
fdp-modelserver
An umbrella project for multiple implementations of model serving
Stars: ✭ 47 (+88%)
Mutual labels:  spark-streaming
parquet-dotnet
🐬 Apache Parquet for modern .Net
Stars: ✭ 199 (+696%)
Mutual labels:  apache-spark
spark3D
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Stars: ✭ 23 (-8%)
Mutual labels:  apache-spark
plasma-docker
Plasmoid for KDE Plasma to control docker containers
Stars: ✭ 38 (+52%)
Mutual labels:  convenience
cloud-integration
Spark cloud integration: tests, cloud committers and more
Stars: ✭ 20 (-20%)
Mutual labels:  apache-spark
spark-connector
A connector for Apache Spark to access Exasol
Stars: ✭ 13 (-48%)
Mutual labels:  apache-spark
interview-refresh-java-bigdata
a one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Stars: ✭ 25 (+0%)
Mutual labels:  spark-streaming
Detecting-Malicious-URL-Machine-Learning
No description or website provided.
Stars: ✭ 47 (+88%)
Mutual labels:  apache-spark
seatunnel-example
seatunnel plugin developing examples.
Stars: ✭ 27 (+8%)
Mutual labels:  spark-streaming
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (+36%)
Mutual labels:  spark-streaming
Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (+836%)
Mutual labels:  apache-spark
ZstdFortranLib
👨‍💻Zaak's 🧩(missing) 🏛Standard 🔬Fortran 📚Library 🚧(WIP)
Stars: ✭ 17 (-32%)
Mutual labels:  convenience
sparkucx
A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (+28%)
Mutual labels:  apache-spark
open-stream-processing-benchmark
This repository contains the code base for the Open Stream Processing Benchmark.
Stars: ✭ 37 (+48%)
Mutual labels:  spark-streaming
Pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+824%)
Mutual labels:  apache-spark
Awesome Ai Infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
Stars: ✭ 223 (+792%)
Mutual labels:  apache-spark
T-Watch
Real Time Twitter Sentiment Analysis Product
Stars: ✭ 20 (-20%)
Mutual labels:  spark-streaming
Spark Workshop
Apache Spark™ and Scala Workshops
Stars: ✭ 224 (+796%)
Mutual labels:  apache-spark
Quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (+768%)
Mutual labels:  apache-spark
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (+48%)
Mutual labels:  spark-streaming
1-60 of 180 similar projects