Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-98.67%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-97.66%)
Kafka UiOpen-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (-97.82%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-98.32%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-97.95%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (-91.89%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-99.08%)
Streamxkafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-99.09%)
Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (-94.19%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-99.82%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-98.97%)
WhatsmarsJava生态研究(Spring Boot + Redis + Dubbo + RocketMQ + Elasticsearch)🔥🔥🔥🔥🔥
Stars: ✭ 1,389 (-86.83%)
Rsysloga Rocket-fast SYStem for LOG processing
Stars: ✭ 1,385 (-86.86%)
AmbariMirror of Apache Ambari
Stars: ✭ 1,576 (-85.05%)
Springboot Labs一个涵盖六个专栏:Spring Boot 2.X、Spring Cloud、Spring Cloud Alibaba、Dubbo、分布式消息队列、分布式事务的仓库。希望胖友小手一抖,右上角来个 Star,感恩 1024
Stars: ✭ 12,804 (+21.43%)
Java Kafka ClientOpenTracing Instrumentation for Apache Kafka Client
Stars: ✭ 101 (-99.04%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (-86.39%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-99.06%)
StormkafkamonDumps state of Storm Kafka consumers
Stars: ✭ 99 (-99.06%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (-98.92%)
Ultimate GoThis repo contains my notes on working with Go and computer systems.
Stars: ✭ 1,530 (-85.49%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-98.96%)
BrighterCommand Dispatcher, Processor, and Distributed Task Queue
Stars: ✭ 1,393 (-86.79%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-98.93%)
KukulcanA REPL for Apache Kafka
Stars: ✭ 103 (-99.02%)
Go Kafka ExampleGolang Kafka consumer and producer example
Stars: ✭ 108 (-98.98%)
Kafka Connectequivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (-99.03%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-98.91%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-99.04%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+7.91%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-99.05%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (-85.36%)
Springboot Templatesspringboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Stars: ✭ 100 (-99.05%)
Syslog Ngsyslog-ng is an enhanced log daemon, supporting a wide range of input and output methods: syslog, unstructured text, queueing, SQL & NoSQL.
Stars: ✭ 1,555 (-85.25%)
KuduMirror of Apache Kudu
Stars: ✭ 1,360 (-87.1%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-98.99%)
Kafka VisualizerA web client for visualizing your Apache Kafka topics live.
Stars: ✭ 98 (-99.07%)
OrcAn ORC file format reader and writer for Go.
Stars: ✭ 97 (-99.08%)
Kkbinlog支持mysql、MongoDB数据变更订阅分发
Stars: ✭ 112 (-98.94%)
Ksql PythonA python wrapper for the KSQL REST API.
Stars: ✭ 98 (-99.07%)
KaffeAn opinionated Elixir wrapper around brod, the Erlang Kafka client, that supports encrypted connections to Heroku Kafka out of the box.
Stars: ✭ 106 (-98.99%)
WillaA Clojure DSL for Kafka Streams
Stars: ✭ 97 (-99.08%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-85.67%)
MythReliable messages resolve distributed transactions
Stars: ✭ 1,470 (-86.06%)
SupermanSuperman是什么:构建Java 高级开发技术的知识体系,从基础不断打怪升级成为超人之路(更新中.......)
Stars: ✭ 106 (-98.99%)
Kafka Phpkafka php client
Stars: ✭ 1,340 (-87.29%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-87.31%)
IlluminatiThis is a Platform that collects all the data accuring in your Application and shows the data in real time by using Kibana or other tools.
Stars: ✭ 106 (-98.99%)
TreevizTree diagrams with JavaScript 🌲 📈
Stars: ✭ 95 (-99.1%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-99.13%)