BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-98.96%)
Flow PipelineA set of tools and examples to run a flow-pipeline (sFlow, NetFlow)
Stars: ✭ 86 (-99.18%)
BrighterCommand Dispatcher, Processor, and Distributed Task Queue
Stars: ✭ 1,393 (-86.79%)
Bigpipe以Kafka为存储介质,提供异步Http RPC的中间件
Stars: ✭ 84 (-99.2%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-98.93%)
Propulsion.NET event stream projection and scheduling platform with EventStore, CosmosDb, Equinox and Kafka integrations
Stars: ✭ 84 (-99.2%)
KukulcanA REPL for Apache Kafka
Stars: ✭ 103 (-99.02%)
Open Bank MarkA bank simulation application using mainly Clojure, which can be used to end-to-end test and show some graphs.
Stars: ✭ 81 (-99.23%)
Go Kafka ExampleGolang Kafka consumer and producer example
Stars: ✭ 108 (-98.98%)
KsppA high performance/ real-time C++ Kafka streams framework (C++17)
Stars: ✭ 80 (-99.24%)
Kafka Connectequivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (-99.03%)
PanoptesA Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (-99.24%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-98.91%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-99.24%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-99.04%)
IotdbApache IoTDB
Stars: ✭ 1,221 (-88.42%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+7.91%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-99.25%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-99.05%)
Fs2 KafkaKafka client for functional streams for scala (fs2)
Stars: ✭ 75 (-99.29%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (-85.36%)
CookbookThe Data Engineering Cookbook
Stars: ✭ 9,829 (-6.78%)
Springboot Templatesspringboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Stars: ✭ 100 (-99.05%)
BookkeeperApache Bookkeeper
Stars: ✭ 1,178 (-88.83%)
Syslog Ngsyslog-ng is an enhanced log daemon, supporting a wide range of input and output methods: syslog, unstructured text, queueing, SQL & NoSQL.
Stars: ✭ 1,555 (-85.25%)
Kafka Elasticsearch InjectorGolang app to read records from a set of kafka topics and write them to an elasticsearch cluster
Stars: ✭ 70 (-99.34%)
KuduMirror of Apache Kudu
Stars: ✭ 1,360 (-87.1%)
BurrowuiThis is a NodeJS/Angular 2 frontend UI for Kafka cluster monitoring with Burrow
Stars: ✭ 69 (-99.35%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-98.99%)
HydraA real-time data replication platform that "unbundles" the receiving, transforming, and transport of data streams.
Stars: ✭ 68 (-99.36%)
OrcAn ORC file format reader and writer for Go.
Stars: ✭ 97 (-99.08%)
Kkbinlog支持mysql、MongoDB数据变更订阅分发
Stars: ✭ 112 (-98.94%)
Flink ShadedApache Flink shaded artifacts repository
Stars: ✭ 67 (-99.36%)
Camel Kafka ConnectorCamel Kafka Connector allows you to use all Camel components as Kafka Connect connectors
Stars: ✭ 63 (-99.4%)
KryptoflowReal-time analysis of bitcoin markets with Kafka and Tensorflow Serving
Stars: ✭ 66 (-99.37%)
KaffeAn opinionated Elixir wrapper around brod, the Erlang Kafka client, that supports encrypted connections to Heroku Kafka out of the box.
Stars: ✭ 106 (-98.99%)
TreevizTree diagrams with JavaScript 🌲 📈
Stars: ✭ 95 (-99.1%)
Springwolf CoreAutomated documentation for async APIs built with Spring Boot
Stars: ✭ 63 (-99.4%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-85.67%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-87.31%)
WarpConvert and analyze large data sets at light speed, on Mac and iOS.
Stars: ✭ 62 (-99.41%)
IlluminatiThis is a Platform that collects all the data accuring in your Application and shows the data in real time by using Kibana or other tools.
Stars: ✭ 106 (-98.99%)
NabhashAn extremely fast Non-crypto-safe AES Based Hash algorithm for Big Data
Stars: ✭ 62 (-99.41%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-99.13%)
Ansible PlaybookAnsible playbook to deploy distributed technologies
Stars: ✭ 61 (-99.42%)
Pg2kafkaShip changes in Postgres 🐘 to Kafka 📖
Stars: ✭ 61 (-99.42%)
Schema RegistryConfluent Schema Registry for Kafka
Stars: ✭ 1,647 (-84.38%)
MeetupKafka 한국 사용자 모임에서 운영하는 meetup repository
Stars: ✭ 106 (-98.99%)
ReefMirror of Apache REEF
Stars: ✭ 92 (-99.13%)
KaramelKafka Browser that supports standalone Kafka and Strimzi operator
Stars: ✭ 61 (-99.42%)
VerticapyVerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-99.44%)
JulieA solution to help you build automation and gitops in your Apache Kafka deployments. The Kafka gitops!
Stars: ✭ 104 (-99.01%)