450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...

Stars: ✭ 1,000 (+458.66%)

Mutual labels: kafka, hbase

Spring Boot Quick

🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如：rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌

Stars: ✭ 1,819 (+916.2%)

Mutual labels: spark, hbase

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (-29.61%)

Mutual labels: spark, flink

Gaffer

A large-scale entity and relation database supporting aggregation of properties

Stars: ✭ 1,642 (+817.32%)

Mutual labels: spark, hbase

Abris

Avro SerDe for Apache Spark structured APIs.

Stars: ✭ 130 (-27.37%)

Mutual labels: kafka, spark

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-1.12%)

Mutual labels: kafka, hbase

Registry

Schema Registry

Stars: ✭ 184 (+2.79%)

Mutual labels: kafka, flink

Seldon Server

Machine Learning Platform and Recommendation Engine built on Kubernetes

Stars: ✭ 1,435 (+701.68%)

Mutual labels: kafka, spark

Data Accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Stars: ✭ 247 (+37.99%)

Mutual labels: kafka, spark

Spiderman

基于 scrapy-redis 的通用分布式爬虫框架

Stars: ✭ 392 (+118.99%)

Mutual labels: kafka, hbase

Hadoop cookbook

Cookbook to install Hadoop 2.0+ using Chef

Stars: ✭ 82 (-54.19%)

Mutual labels: spark, hbase

Example Spark Kafka

Apache Spark and Apache Kafka integration example

Stars: ✭ 120 (-32.96%)

Mutual labels: kafka, spark

Streamline

StreamLine - Streaming Analytics

Stars: ✭ 151 (-15.64%)

Mutual labels: kafka, flink

Spark Kafka Writer

Write your Spark data to Kafka seamlessly

Stars: ✭ 175 (-2.23%)

Mutual labels: kafka, spark

Spark Ml Source Analysis

spark ml 算法原理剖析以及具体的源码实现分析

Stars: ✭ 1,873 (+946.37%)

Mutual labels: spark

Benchm Ml

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).

Stars: ✭ 1,835 (+925.14%)

Mutual labels: spark

Aztk

AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure

Stars: ✭ 152 (-15.08%)

Mutual labels: spark

Mirus

Mirus is a cross data-center data replication tool for Apache Kafka

Stars: ✭ 171 (-4.47%)

Mutual labels: kafka

Whylogs Java

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 164 (-8.38%)

Mutual labels: spark

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-16.2%)

Mutual labels: spark

Cc Pyspark

Process Common Crawl data with Python and Spark

Stars: ✭ 147 (-17.88%)

Mutual labels: spark

Netty Learning Example

🥚 Netty实践学习案例，见微知著！带着你的心，跟着教程。我相信你行欧。

Stars: ✭ 2,146 (+1098.88%)

Mutual labels: kafka

Phpkafka

PHP Kafka client is used in PHP-FPM and Swoole. PHP Kafka client supports 50 APIs, which might be one that supports the most message types ever.

Stars: ✭ 149 (-16.76%)

Mutual labels: kafka

Fero

light, fast, scalable, streaming microservices made easy

Stars: ✭ 175 (-2.23%)

Mutual labels: kafka

Deeplearning4j

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…

Stars: ✭ 12,277 (+6758.66%)

Mutual labels: spark

Dcos Commons

DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.

Stars: ✭ 162 (-9.5%)

Mutual labels: kafka

Kafka Eagle

A easy and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.

Stars: ✭ 2,240 (+1151.4%)

Mutual labels: kafka

Kcli

A kafka command line browser

Stars: ✭ 148 (-17.32%)

Mutual labels: kafka

Linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,323 (+1197.77%)

Mutual labels: spark

A Kafka Story

Kafka ecosystem ... but step by step!

Stars: ✭ 148 (-17.32%)

Mutual labels: kafka

Azkarra Streams

🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.

Stars: ✭ 146 (-18.44%)

Mutual labels: kafka

Logi Kafkamanager

一站式Apache Kafka集群指标监控与运维管控平台

Stars: ✭ 3,280 (+1732.4%)

Mutual labels: kafka

Kop

Kafka-on-Pulsar - A protocol handler that brings native Kafka protocol to Apache Pulsar

Stars: ✭ 159 (-11.17%)

Mutual labels: kafka

Kafkajs

A modern Apache Kafka client for node.js

Stars: ✭ 2,315 (+1193.3%)

Mutual labels: kafka

Camellia

camellia framework by netease-im. provider: 1) redis-client; 2) redis-proxy(redis-sentinel/redis-cluster); 3) hbase-client; 4) others

Stars: ✭ 146 (-18.44%)

Mutual labels: hbase

Vue Info Card

Simple and beautiful card component with an elegant spark line, for VueJS.

Stars: ✭ 159 (-11.17%)

Mutual labels: spark

Pyspark Learning

Updated repository

Stars: ✭ 147 (-17.88%)

Mutual labels: spark

Datacompy

Pandas and Spark DataFrame comparison for humans

Stars: ✭ 147 (-17.88%)

Mutual labels: spark

Kafka Book

《Kafka技术内幕》代码

Stars: ✭ 175 (-2.23%)

Mutual labels: kafka

Kraps Rpc

A RPC framework leveraging Spark RPC module

Stars: ✭ 175 (-2.23%)

Mutual labels: spark

Transmogrifai

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Stars: ✭ 2,084 (+1064.25%)

Mutual labels: spark

Event Sourcing Jambo

An Hexagonal Architecture with DDD + Aggregates + Event Sourcing using .NET Core, Kafka e MongoDB (Blog Engine)

Stars: ✭ 159 (-11.17%)

Mutual labels: kafka

Supersafebank

Sample Event Sourcing implementation with .NET Core

Stars: ✭ 142 (-20.67%)

Mutual labels: kafka

Glow

An open-source toolkit for large-scale genomic analysis