HeraclesHigh performance HBase / Spark SQL engine
Stars: ✭ 27 (-52.63%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+55370.18%)
BurrowxKafka consumer lag monitor
Stars: ✭ 50 (-12.28%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+1657.89%)
Aidp weiboAd Infrastructure Data Processor : kafka consumer embedded Lua scripting language in data process framework
Stars: ✭ 20 (-64.91%)
FlintA Time Series Library for Apache Spark
Stars: ✭ 878 (+1440.35%)
Connection Pool Client💥 A simple multi-purpose connection pool client (Kafka & Hbase & Redis & RMDB & Socket & Http)
Stars: ✭ 40 (-29.82%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-75.44%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-75.44%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-5.26%)
RafkaKafka proxy with a simple API, speaking the Redis protocol
Stars: ✭ 49 (-14.04%)
Storm Dynamic SpoutA framework for building spouts for Apache Storm and a Kafka based spout for dynamically skipping messages to be processed later.
Stars: ✭ 40 (-29.82%)
SaramaSarama is a Go library for Apache Kafka 0.8, and up.
Stars: ✭ 7,964 (+13871.93%)
KafkatoolsCLI tools for monitoring and managing Apache Kafka
Stars: ✭ 13 (-77.19%)
UrhoxUrho3D extension library
Stars: ✭ 13 (-77.19%)
Nagios Plugins450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+1654.39%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-78.95%)
MlfeatureFeature engineering toolkit for Spark MLlib.
Stars: ✭ 12 (-78.95%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1650.88%)
Javaok必看!java后端,亮剑诛仙。java发展路线技术要点。
Stars: ✭ 867 (+1421.05%)
Ssm销售系统项目,spring+spring mvc+mybatis+dubbo+kafka+redis+maven
Stars: ✭ 55 (-3.51%)
SpringbootSpringBoot 整合各类框架和应用
Stars: ✭ 54 (-5.26%)
Ruby KafkaA Ruby client library for Apache Kafka
Stars: ✭ 1,039 (+1722.81%)
Python Kafka ElasticsearchSimple learning project pushing CSV data into Kafka then indexing the data in ElasticSearch
Stars: ✭ 11 (-80.7%)
Rom KafkaApache Kafka support for Ruby Object Mapper
Stars: ✭ 11 (-80.7%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-80.7%)
Go Kafka AvroA library provides consumer/producer to work with kafka, avro and schema registry
Stars: ✭ 39 (-31.58%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-80.7%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+1400%)
SnappydataProject SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (+1645.61%)
KudoKubernetes Universal Declarative Operator (KUDO)
Stars: ✭ 849 (+1389.47%)
Tiledb VcfEfficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-54.39%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+1777.19%)
KowlApache Kafka Web UI for exploring messages, consumers, configurations and more with a focus on a good UI & UX.
Stars: ✭ 1,036 (+1717.54%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1629.82%)
Spark SwaggerSpark (http://sparkjava.com/) support for Swagger (https://swagger.io/)
Stars: ✭ 25 (-56.14%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+1529.82%)
ChroniclerScala toolchain for InfluxDB
Stars: ✭ 24 (-57.89%)
ScaleAnother example of a REST API with Akka HTTP
Stars: ✭ 23 (-59.65%)
DigitrecognizerJava Convolutional Neural Network example for Hand Writing Digit Recognition
Stars: ✭ 23 (-59.65%)
Foundationdb4sType-safe and idiomatic Scala client for FoundationDB
Stars: ✭ 23 (-59.65%)
Zaneperfor前端性能监控系统,消息队列,高可用,集群等相关架构
Stars: ✭ 1,085 (+1803.51%)
PretendyourexyzzyA web clone of the card game Cards Against Humanity.
Stars: ✭ 1,069 (+1775.44%)
BdsBlockchain data parsing and persisting results
Stars: ✭ 1,032 (+1710.53%)
Scrapy ClusterThis Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Stars: ✭ 921 (+1515.79%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+1507.02%)