Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-18.33%)
ScramjetSimple yet powerful live data computation framework
Stars: ✭ 171 (-5%)
PhpkafkaPHP Kafka client is used in PHP-FPM and Swoole. PHP Kafka client supports 50 APIs, which might be one that supports the most message types ever.
Stars: ✭ 149 (-17.22%)
Dcos CommonsDC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
Stars: ✭ 162 (-10%)
Kafka EagleA easy and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.
Stars: ✭ 2,240 (+1144.44%)
KcliA kafka command line browser
Stars: ✭ 148 (-17.78%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-2.22%)
JkesA search framework and multi-tenant search platform based on java, kafka, kafka connect, elasticsearch
Stars: ✭ 173 (-3.89%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+1190.56%)
A Kafka StoryKafka ecosystem ... but step by step!
Stars: ✭ 148 (-17.78%)
Azkarra Streams🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
Stars: ✭ 146 (-18.89%)
KopKafka-on-Pulsar - A protocol handler that brings native Kafka protocol to Apache Pulsar
Stars: ✭ 159 (-11.67%)
KafkajsA modern Apache Kafka client for node.js
Stars: ✭ 2,315 (+1186.11%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1298.89%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-11.67%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-18.33%)
SupersafebankSample Event Sourcing implementation with .NET Core
Stars: ✭ 142 (-21.11%)
Event Sourcing JamboAn Hexagonal Architecture with DDD + Aggregates + Event Sourcing using .NET Core, Kafka e MongoDB (Blog Engine)
Stars: ✭ 159 (-11.67%)
PhobosSimplifying Kafka for ruby apps
Stars: ✭ 176 (-2.22%)
MirusMirus is a cross data-center data replication tool for Apache Kafka
Stars: ✭ 171 (-5%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-11.67%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+891.67%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+6720.56%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-12.22%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+867.78%)
Spark AuthorizerA Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-21.67%)
Queue本人的RabbitMQ和Kafka详细笔记以及示例代码
Stars: ✭ 158 (-12.22%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-21.11%)
My MomentsInstagram Clone - Cloning Instagram for learning purpose
Stars: ✭ 140 (-22.22%)
Ferolight, fast, scalable, streaming microservices made easy
Stars: ✭ 175 (-2.78%)
Enqueue DevMessage Queue, Job Queue, Broadcasting, WebSockets packages for PHP, Symfony, Laravel, Magento. DEVELOPMENT REPOSITORY - provided by Forma-Pro
Stars: ✭ 1,977 (+998.33%)
Data science blogsA repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-22.78%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-22.22%)
Kafka RestConfluent REST Proxy for Kafka
Stars: ✭ 1,863 (+935%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+1057.78%)
Spring Boot Vue Bank我,请始皇[打钱]是一个前后端分离的工具人系统,项目采用 SpringBoot+Go+Vue 开发,项目加入常见的企业级应用所涉及到的技术点,例如 Redis、RabbitMQ 等(主要是多用用工具多踩踩坑)。
Stars: ✭ 157 (-12.78%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-15.56%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-22.78%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-2.78%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-22.78%)
Laravel QueueLaravel Enqueue message queue extension. Supports AMQP, Amazon SQS, Kafka, Google PubSub, Redis, STOMP, Gearman, Beanstalk and others
Stars: ✭ 155 (-13.89%)
Syslog GollectorSyslog Collector written in Go, streams to Kafka 0.8
Stars: ✭ 138 (-23.33%)
Kafka Connect Mongodb**Unofficial / Community** Kafka Connect MongoDB Sink Connector - Find the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
Stars: ✭ 137 (-23.89%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+911.67%)
Parallel ConsumerParallel Apache Kafka client wrapper with client side queueing, a simpler consumer/producer API with key concurrency and extendable non-blocking IO processing.
Stars: ✭ 154 (-14.44%)
WaterdropWaterDrop is a standalone Karafka component library for generating Kafka messages
Stars: ✭ 136 (-24.44%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-14.44%)