kafka-shell⚡A supercharged, interactive Kafka shell built on top of the existing Kafka CLI tools.
Stars: ✭ 107 (+35.44%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+3277.22%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+77.22%)
MnemonicApache Mnemonic - A non-volatile hybrid memory storage oriented library
Stars: ✭ 91 (+15.19%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+13163.29%)
HudiUpserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (+3173.42%)
awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+13941.77%)
GearpumpLightweight real-time big data streaming engine over Akka
Stars: ✭ 745 (+843.04%)
Openwhisk Runtime NodejsApache OpenWhisk Runtime NodeJS supports Apache OpenWhisk functions written in JavaScript for NodeJS
Stars: ✭ 43 (-45.57%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1148.1%)
ExamplesDemo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (+622.78%)
Awesome SolrA curated list of Awesome Apache Solr links and resources.
Stars: ✭ 69 (-12.66%)
Storm Dynamic SpoutA framework for building spouts for Apache Storm and a Kafka based spout for dynamically skipping messages to be processed later.
Stars: ✭ 40 (-49.37%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-5.06%)
Docs4dev后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTSDB,MySQL,PostgreSQL)等最新官方文档以及对应的中文翻译。
Stars: ✭ 974 (+1132.91%)
Logging Log4j2Apache Log4j 2 is an upgrade to Log4j that provides significant improvements over its predecessor, Log4j 1.x, and provides many of the improvements available in Logback while fixing some inherent problems in Logback's architecture.
Stars: ✭ 1,133 (+1334.18%)
AutocrawlerGoogle, Naver multiprocess image web crawler (Selenium)
Stars: ✭ 957 (+1111.39%)
MachineMachine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-1.27%)
Openwhisk CliApache OpenWhisk Command Line Interface (CLI)
Stars: ✭ 73 (-7.59%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-26.58%)
PantherDetect threats with log data and improve cloud security posture
Stars: ✭ 885 (+1020.25%)
TutorialsA project for developing tutorials for Streams
Stars: ✭ 14 (-82.28%)
Openwhisk ApigatewayApache OpenWhisk API Gateway service for exposing actions as REST interfaces.
Stars: ✭ 56 (-29.11%)
Streamsx.inetThis toolkit supports common internet protocols, such as HTTP and WebSockets
Stars: ✭ 11 (-86.08%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-12.66%)
StreamElegant stream processing pipeline written entirely in Golang
Stars: ✭ 45 (-43.04%)
Fs2 KafkaKafka client for functional streams for scala (fs2)
Stars: ✭ 75 (-5.06%)
Spring Web Rss ChannelsA Full Stack RSS Reader web application built with Spring MVC and JSP. It uses libraries like Spring, JPA, Bootstrap, Apache Tiles, JSP etc. There is also a static code analysis tool called Checkstyle.
Stars: ✭ 40 (-49.37%)
BurrowuiThis is a NodeJS/Angular 2 frontend UI for Kafka cluster monitoring with Burrow
Stars: ✭ 69 (-12.66%)
Reddit sse streamA Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Stars: ✭ 39 (-50.63%)
Php Apache TikaApache Tika bindings for PHP: extract text and metadata from documents, images and other formats
Stars: ✭ 76 (-3.8%)
SaberWindow-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-55.7%)
Fail2ban.webexploitsThis custom Fail2Ban filter and jail will deal with all scans for common Wordpress, Joomla and other Web Exploits being scanned for by automated bots and those seeking to find exploitable web sites.
Stars: ✭ 67 (-15.19%)
Awesome Cordova PluginsA curated list of awesome Cordova Apache Plugins https://cordova.apache.org/plugins/
Stars: ✭ 33 (-58.23%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-6.33%)
Streamsx.messagingThis toolkit is focused on interacting with popular messaging systems such as Kafka, JMS, XMS, and MQTT. After release v5.4.2 the complete toolkit will be deprecated. See the README.md file for hints to alternative toolkits.
Stars: ✭ 31 (-60.76%)
Pg2kafkaShip changes in Postgres 🐘 to Kafka 📖
Stars: ✭ 61 (-22.78%)
Aws Auto Terminate Idle EmrAWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-73.42%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (+1.27%)
Awesome WicketA curated list of awesome projects powered by Apache Wicket
Stars: ✭ 56 (-29.11%)
SiddhiStream Processing and Complex Event Processing Engine
Stars: ✭ 1,185 (+1400%)
WormholeWormhole is a SPaaS (Stream Processing as a Service) Platform
Stars: ✭ 863 (+992.41%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-30.38%)
Tuna🐟 A streaming ETL for fish
Stars: ✭ 11 (-86.08%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+984.81%)
Poi Android📈 Apache POI for Android
Stars: ✭ 77 (-2.53%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+982.28%)
Druid ExporterA Golang based exporter captures druid API related metrics and receives druid-emitting HTTP JSON data.
Stars: ✭ 54 (-31.65%)
VectorA reliable, high-performance tool for building observability data pipelines.
Stars: ✭ 8,736 (+10958.23%)
AkarataIndonesian stemmer - Pustaka JavaScript untuk mengambil kata dasar dari kata berimbuhan pada bahasa Indonesia.
Stars: ✭ 26 (-67.09%)