All Projects → gearpump → Gearpump

gearpump / Gearpump

Licence: apache-2.0
Lightweight real-time big data streaming engine over Akka

Programming Languages

scala
5932 projects

Projects that are alternatives of or similar to Gearpump

data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-93.29%)
Mutual labels:  bigdata, stream-processing
Hudi Resources
汇总Apache Hudi相关资料
Stars: ✭ 79 (-89.4%)
Mutual labels:  bigdata, stream-processing
Akka Stream Contrib
Add-ons to Akka Stream
Stars: ✭ 173 (-76.78%)
Mutual labels:  stream-processing, akka
Hudi
Upserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (+247.11%)
Mutual labels:  bigdata, stream-processing
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-66.58%)
Mutual labels:  bigdata, akka
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+258.12%)
Mutual labels:  bigdata, stream-processing
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+1306.44%)
Mutual labels:  bigdata, stream-processing
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+1388.99%)
Mutual labels:  bigdata, stream-processing
Akka-Streams-custom-stream-processing-examples
Demos of how to do custom stream processing using the Akka Streams GraphStages API
Stars: ✭ 13 (-98.26%)
Mutual labels:  akka, stream-processing
Cds
Data syncing in golang for ClickHouse.
Stars: ✭ 501 (-32.75%)
Mutual labels:  bigdata
Faust
Python Stream Processing
Stars: ✭ 5,899 (+691.81%)
Mutual labels:  stream-processing
Stream Json
The micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API.
Stars: ✭ 462 (-37.99%)
Mutual labels:  stream-processing
Akka Http Json
Integrate some of the best JSON libs in Scala with Akka HTTP
Stars: ✭ 530 (-28.86%)
Mutual labels:  akka
Spring Cloud Stream
Framework for building Event-Driven Microservices
Stars: ✭ 662 (-11.14%)
Mutual labels:  stream-processing
Bigslice
A serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-37.05%)
Mutual labels:  bigdata
Akka Http Microservice
Example of http (micro)service in Scala & akka-http
Stars: ✭ 701 (-5.91%)
Mutual labels:  akka
Hazelcast
Open-source distributed computation and storage platform
Stars: ✭ 4,662 (+525.77%)
Mutual labels:  stream-processing
Akkadotnet Code Samples
Akka.NET professional reference code samples
Stars: ✭ 451 (-39.46%)
Mutual labels:  akka
Vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+811.81%)
Mutual labels:  bigdata
Akka Essentials
Java/Scala Examples from the book - Akka Essentials
Stars: ✭ 700 (-6.04%)
Mutual labels:  akka

Gearpump

Maven Central GitHub license Build Status codecov.io Join the chat at https://gitter.im/gearpump/gearpump

Gearpump is a lightweight real-time big data streaming engine. It is inspired by recent advances in the Akka framework and a desire to improve on existing streaming frameworks.

The name Gearpump is a reference to the engineering term "gear pump", which is a super simple pump that consists of only two gears, but is very powerful at streaming water.

We model streaming within the Akka actor hierarchy.

Per initial benchmarks we are able to process near 18 million messages/second (100 bytes per message) with a 8ms latency on a 4-node cluster.

For steps to reproduce the performance test, please check Performance benchmark.

Useful Resources

How to Build

1). Clone the Gearpump repository

  git clone https://github.com/gearpump/gearpump.git
  cd gearpump

2). Build package

  ## Please use scala 2.12
  ## The target package path: output/target/gearpump-${version}.zip
  sbt clean +assembly +packArchiveZip

After the build, there will be a package file gearpump-${version}.zip generated under output/target/ folder.

To build scala document, use

   ## Will generate the scala doc under target/scala_2.xx/unidoc/
   sbt unidoc

NOTE: The build requires network connection. If you are behind an enterprise proxy, make sure you have set the proxy in your env before running the build commands. For windows:

set HTTP_PROXY=http://host:port
set HTTPS_PROXY= http://host:port

For Linux:

export HTTP_PROXY=http://host:port
export HTTPS_PROXY=http://host:port

How to do style check before submitting a pull request?

Before submitting a PR, you should always run style check first:

  ## Run style check for compile, test, and integration test.
  sbt scalastyle test:scalastyle it:scalastyle

How to generate the license report to generate a list of all dependencies

  sbt dumpLicenseReport

How to generate dependencies by declared license

  sbt dependencyLicenseInfo

Contributors (time order)

License

Gearpump itself is licensed under the Apache License (2.0). For library it used, please see LICENSE.

Acknowledgement

The netty transport code work is based on Apache Storm. Thanks Apache Storm contributors.

The cgroup code work is based on JStorm. Thanks JStorm contributors.

Thanks to Jetbrains for providing a IntelliJ IDEA Free Open Source License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].