Awesome KafkaEverything about Apache Kafka
Stars: ✭ 144 (-17.71%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+920%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1338.86%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+895.43%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-13.14%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-18.86%)
WaterdropWaterDrop is a standalone Karafka component library for generating Kafka messages
Stars: ✭ 136 (-22.29%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-20%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-0.57%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-12%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-20.57%)
Syslog GollectorSyslog Collector written in Go, streams to Kafka 0.8
Stars: ✭ 138 (-21.14%)
RedpandaRedpanda is the real-time engine for modern apps. Kafka API Compatible; 10x faster 🚀 See more at vectorized.io/redpanda
Stars: ✭ 3,114 (+1679.43%)
Kafka Connect Mongodb**Unofficial / Community** Kafka Connect MongoDB Sink Connector - Find the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
Stars: ✭ 137 (-21.71%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+6915.43%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-12.57%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-6.29%)
Apache Spark NodeNode.js bindings for Apache Spark DataFrame APIs
Stars: ✭ 136 (-22.29%)
SecorSecor is a service implementing Kafka log persistence
Stars: ✭ 1,728 (+887.43%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+6724.57%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-13.14%)
GokaGoka is a compact yet powerful distributed stream processing library for Apache Kafka written in Go.
Stars: ✭ 1,862 (+964%)
Node RdkafkaNode.js bindings for librdkafka
Stars: ✭ 1,799 (+928%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (+0%)
Sdk JavaJava SDK for CloudEvents
Stars: ✭ 173 (-1.14%)
Rdkafka RubyModern and performant Kafka client library for Ruby based on librdkafka
Stars: ✭ 152 (-13.14%)
Echo🦄 开源社区系统:基于 SpringBoot + MyBatis + MySQL + Redis + Kafka + Elasticsearch + Spring Security + ... 并提供详细的开发文档和配套教程。包含帖子、评论、私信、系统通知、点赞、关注、搜索、用户设置、数据统计等模块。
Stars: ✭ 129 (-26.29%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (-13.71%)
SamsaraSamsara is a real-time analytics platform
Stars: ✭ 132 (-24.57%)
Dcos CommonsDC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
Stars: ✭ 162 (-7.43%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-13.71%)
Components ContribCommunity driven, reusable components for distributed apps
Stars: ✭ 131 (-25.14%)
Metronome Metronome is a distributed and fault-tolerant event scheduler
Stars: ✭ 131 (-25.14%)
AkhqKafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
Stars: ✭ 2,195 (+1154.29%)
Kafka JunitThis library wraps Kafka's embedded test cluster, allowing you to more easily create and run integration tests using JUnit against a "real" kafka server running within the context of your tests. No need to stand up an external kafka cluster!
Stars: ✭ 131 (-25.14%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+1090.86%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+1227.43%)
OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (-26.29%)
Spylon KernelJupyter kernel for scala and spark
Stars: ✭ 129 (-26.29%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+948.57%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+883.43%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+838.29%)
KopKafka-on-Pulsar - A protocol handler that brings native Kafka protocol to Apache Pulsar
Stars: ✭ 159 (-9.14%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-13.14%)
Scrapy demoall kinds of scrapy demo
Stars: ✭ 128 (-26.86%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-26.86%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-14.29%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+1372%)