Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+3305.56%)
Storm Dynamic SpoutA framework for building spouts for Apache Storm and a Kafka based spout for dynamically skipping messages to be processed later.
Stars: ✭ 40 (+122.22%)
HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+25800%)
StormMirror of Apache Storm
Stars: ✭ 6,297 (+34883.33%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+4650%)
SmooksAn extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (+1527.78%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (+722.22%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+438.89%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-16.67%)
CS Book🔥 Latest computer science e-books。提供最新技术类电子书下载, “我无非就是想卷死各位,或者被各位卷死!”
Stars: ✭ 40 (+122.22%)
Big-Data-Demo基于Vue、three.js、echarts,数据可视化展示项目,包含三维模型导入交互、三维模型标注等功能
Stars: ✭ 146 (+711.11%)
kafka-shell⚡A supercharged, interactive Kafka shell built on top of the existing Kafka CLI tools.
Stars: ✭ 107 (+494.44%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (+105.56%)
spark-recordsBulletproof Apache Spark jobs with fast root cause analysis of failures.
Stars: ✭ 67 (+272.22%)
ramenA stream processing language and compiler for small-scale monitoring
Stars: ✭ 14 (-22.22%)
go-riversCollection of stream processing / multiplexing / networking libs in Go
Stars: ✭ 35 (+94.44%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+216.67%)
artmlARTML- Real time learning
Stars: ✭ 20 (+11.11%)
gretel-python-clientThe Gretel Python Client allows you to interact with the Gretel REST API.
Stars: ✭ 28 (+55.56%)
SGDLibraryMATLAB/Octave library for stochastic optimization algorithms: Version 1.0.20
Stars: ✭ 165 (+816.67%)
RemoteShuffleServiceCeleborn provides an elastic and high-performance service for shuffle and spilled data.
Stars: ✭ 262 (+1355.56%)
siembolAn open-source, real-time Security Information & Event Management tool based on big data technologies, providing a scalable, advanced security analytics framework.
Stars: ✭ 153 (+750%)
xcastA High-Performance Data Science Toolkit for the Earth Sciences
Stars: ✭ 28 (+55.56%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+272.22%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (+33.33%)
beam-siteApache Beam Site
Stars: ✭ 28 (+55.56%)
mageMAGE - Memgraph Advanced Graph Extensions 🔮
Stars: ✭ 89 (+394.44%)
product-spAn open source, cloud-native streaming data integration and analytics product optimized for agile digital businesses
Stars: ✭ 80 (+344.44%)
arrow-datafusionApache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+13011.11%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (+22.22%)
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+200%)
LoL-Match-PredictionWin probability predictions for League of Legends matches using neural networks
Stars: ✭ 34 (+88.89%)
bullet-stormThe Apache Storm implementation of the Bullet backend
Stars: ✭ 39 (+116.67%)
IoT-system-PLC-data-to-InfluxDBThis project aim is to provide free software to fetch data from plcs (Siemens S7-300/400/1200/1500) and store it. Used stack is completly opensource. I used InfluDB as data storage, so application principle is following Big Data paradigm.
Stars: ✭ 26 (+44.44%)
rippleSimple shared surface streaming application
Stars: ✭ 17 (-5.56%)
spark-rootApache Spark Data Source for ROOT File Format
Stars: ✭ 28 (+55.56%)
cloudberryBig Data Visualization
Stars: ✭ 89 (+394.44%)
dxramA distributed in-memory key-value storage for billions of small objects.
Stars: ✭ 25 (+38.89%)
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+45433.33%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+550%)
img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (+6416.67%)
godsendA simple and eloquent workflow for streaming messages to micro-services.
Stars: ✭ 15 (-16.67%)
awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+61527.78%)
stormnodeNode js node client for storm.dev
Stars: ✭ 11 (-38.89%)
GDLibraryMatlab library for gradient descent algorithms: Version 1.0.1
Stars: ✭ 50 (+177.78%)
kafka-workersKafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.
Stars: ✭ 30 (+66.67%)
lcbo-apiA crawler and API server for Liquor Control Board of Ontario retail data
Stars: ✭ 152 (+744.44%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+116.67%)