All Projects → Spark → Similar Projects or Alternatives

549 Open source projects that are alternatives of or similar to Spark

databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-65.45%)
Mutual labels:  parquet, spark-sql
Librdkafka
The Apache Kafka C/C++ library
Stars: ✭ 5,617 (+10112.73%)
Mutual labels:  consumer, kafka-producer
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+3029.09%)
Mutual labels:  streaming, spark-sql
Qbusbridge
The Apache Kafka Client SDK
Stars: ✭ 272 (+394.55%)
Mutual labels:  consumer, kafka-producer
albis
Albis: High-Performance File Format for Big Data Systems
Stars: ✭ 20 (-63.64%)
Mutual labels:  parquet, spark-sql
Movies-Analytics-in-Spark-and-Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Stars: ✭ 47 (-14.55%)
Mutual labels:  spark-sql, spark-dataframes
Anotherkafkamonitor Akm
Another app which used to monitor the progress of Kafka Producer and Consumer
Stars: ✭ 36 (-34.55%)
Mutual labels:  consumer, kafka-producer
Flogo
Project Flogo is an open source ecosystem of opinionated event-driven capabilities to simplify building efficient & modern serverless functions, microservices & edge apps.
Stars: ✭ 1,891 (+3338.18%)
Mutual labels:  streaming, kafka-producer
rabbitmq-consumer
A configurable RabbitMQ consumer made in Rust, useful for a stable and reliable CLI commands processor.
Stars: ✭ 25 (-54.55%)
Mutual labels:  consumer
spark2-etl-examples
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Stars: ✭ 23 (-58.18%)
Mutual labels:  spark-sql
Tweet-Analysis-With-Kafka-and-Spark
A real time analytics dashboard to analyze the trending hashtags and @ mentions at any location using kafka and spark streaming.
Stars: ✭ 18 (-67.27%)
Mutual labels:  spark-sql
standards-maintenance
This repository houses the interactions, consultations and work management to support the maintenance of baselined components of the Consumer Data Right API Standards and Information Security profile.
Stars: ✭ 32 (-41.82%)
Mutual labels:  consumer
sqs-quooler
A complete queue consumer for SQS
Stars: ✭ 23 (-58.18%)
Mutual labels:  consumer
parquet-extra
A collection of Apache Parquet add-on modules
Stars: ✭ 30 (-45.45%)
Mutual labels:  parquet
uvc-streamer
MJPEG webcam network streamer for linux
Stars: ✭ 25 (-54.55%)
Mutual labels:  streaming
odbc2parquet
A command line tool to query an ODBC data source and write the result into a parquet file.
Stars: ✭ 95 (+72.73%)
Mutual labels:  parquet
ember-contextual-services
Services in Ember are scoped to the app as a whole and are singletons. Sometimes you don't want that. :) This addon provides ephemeral route-based services.
Stars: ✭ 20 (-63.64%)
Mutual labels:  consumer
parquet-flinktacular
How to use Parquet in Flink
Stars: ✭ 29 (-47.27%)
Mutual labels:  parquet
qsv
CSVs sliced, diced & analyzed.
Stars: ✭ 438 (+696.36%)
Mutual labels:  parquet
live-cryptocurrency-streaming-flutter
A Flutter app with live cryptocurrency updates, powered by Ably
Stars: ✭ 26 (-52.73%)
Mutual labels:  streaming
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-56.36%)
Mutual labels:  parquet
miniparquet
Library to read a subset of Parquet files
Stars: ✭ 38 (-30.91%)
Mutual labels:  parquet
youtube-dl-nas
youtube download queue websocket server with login for private NAS.
Stars: ✭ 136 (+147.27%)
Mutual labels:  consumer
node-bunnymq
BunnyMQ is an amqp.node wrapper to ease common AMQP usages (RPC, pub/sub, channel/connection handling etc.).
Stars: ✭ 20 (-63.64%)
Mutual labels:  consumer
kafka-proxy
Rust Kafka HTTP proxy
Stars: ✭ 25 (-54.55%)
Mutual labels:  kafka-producer
openmrs-fhir-analytics
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (+0%)
Mutual labels:  parquet
opaque-sql
An encrypted data analytics platform
Stars: ✭ 169 (+207.27%)
Mutual labels:  spark-sql
wow-spark
🔆 spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。
Stars: ✭ 20 (-63.64%)
Mutual labels:  spark-sql
matrixone
Hyperconverged cloud-edge native database
Stars: ✭ 1,057 (+1821.82%)
Mutual labels:  streaming
frizzle
The magic message bus
Stars: ✭ 14 (-74.55%)
Mutual labels:  consumer
dt-sql-parser
SQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (+145.45%)
Mutual labels:  spark-sql
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-65.45%)
Mutual labels:  parquet
aws-kinesis-consumer
Consume an AWS Kinesis Data Stream to look over the records from a terminal.
Stars: ✭ 23 (-58.18%)
Mutual labels:  consumer
LazyMan-iOS
A simple app that lets you stream every live and archived NHL and MLB game from any of your iOS devices.
Stars: ✭ 73 (+32.73%)
Mutual labels:  streaming
geospark
bring sf to spark in production
Stars: ✭ 53 (-3.64%)
Mutual labels:  spark-sql
hadoop-etl-udfs
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-69.09%)
Mutual labels:  parquet
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (+0%)
Mutual labels:  spark-sql
WWDChrome
Chrome extension which lets you watch WWDC Developer Videos in Google Chrome (thus not having to use Safari)
Stars: ✭ 18 (-67.27%)
Mutual labels:  streaming
messaging-polyglot
RabbitMQ Messaging Polyglot with Java, ColdFusion, CommandBox, Groovy and more
Stars: ✭ 18 (-67.27%)
Mutual labels:  consumer
IMCtermite
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary software imcFAMOS/imcSTUDIO and facilitates its storage in open source file formats
Stars: ✭ 20 (-63.64%)
Mutual labels:  parquet
libdvbtee
dvbtee: a digital television streamer / parser / service information aggregator supporting various interfaces including telnet CLI & http control
Stars: ✭ 65 (+18.18%)
Mutual labels:  streaming
Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Stars: ✭ 52 (-5.45%)
Mutual labels:  spark-sql
rejected
rejected is a consumer framework for RabbitMQ
Stars: ✭ 56 (+1.82%)
Mutual labels:  consumer
Psi
Platform for Situated Intelligence
Stars: ✭ 249 (+352.73%)
Mutual labels:  streaming
retail-banking
Consumer Banking Application
Stars: ✭ 25 (-54.55%)
Mutual labels:  consumer
php-kafka-lib
PHP Kafka producer / consumer library with PHP Avro support, based on php-rdkafka
Stars: ✭ 38 (-30.91%)
Mutual labels:  consumer
Swadeshi
Implementing a Web Based solution through which farmers can participate in a commodity exchange market
Stars: ✭ 21 (-61.82%)
Mutual labels:  consumer
continuous-analytics-examples
A collection of examples of continuous analytics.
Stars: ✭ 17 (-69.09%)
Mutual labels:  streaming
Topos
🌀 .NET Event Processing library
Stars: ✭ 22 (-60%)
Mutual labels:  kafka-producer
spark-vcf
Spark VCF data source implementation for Dataframes
Stars: ✭ 15 (-72.73%)
Mutual labels:  spark-sql
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (-38.18%)
Mutual labels:  spark-sql
telemetry-streaming
Spark Streaming ETL jobs for Mozilla Telemetry
Stars: ✭ 16 (-70.91%)
Mutual labels:  streaming
Pulsar Client Go
Apache Pulsar Go Client Library
Stars: ✭ 251 (+356.36%)
Mutual labels:  streaming
pulsar-flex
Pulsar Flex is a modern Apache Pulsar client for Node.js, developed to be independent of C++.
Stars: ✭ 43 (-21.82%)
Mutual labels:  consumer
Waveline Server
Simple self-hosted music streaming server
Stars: ✭ 248 (+350.91%)
Mutual labels:  streaming
Betfair
betfairlightweight - python wrapper for Betfair API-NG (with streaming)
Stars: ✭ 246 (+347.27%)
Mutual labels:  streaming
columnify
Make record oriented data to columnar format.
Stars: ✭ 28 (-49.09%)
Mutual labels:  parquet
Pulsar Manager
Apache Pulsar Manager
Stars: ✭ 247 (+349.09%)
Mutual labels:  streaming
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+349.09%)
Mutual labels:  streaming
terraform-aws-kinesis-firehose
This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.
Stars: ✭ 25 (-54.55%)
Mutual labels:  parquet
1-60 of 549 similar projects