MillerMiller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
ShioajiShioaji all new cross platform api for trading ( 跨平台證券交易API )
Smart openUtils for streaming large files (S3, HDFS, gzip, bz2...)
Rangelessc++ LINQ -like library of higher-order functions for data manipulation
River🌊 Online machine learning in Python
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
GsfGrid Solutions Framework
ToolboxA Java Toolbox for Scalable Probabilistic Machine Learning
PravegaPravega is 100% open source and community-driven. All components are available
under Apache 2 License on
GitHub.
Tractorstructured concurrent, Python parallelism
PysadStreaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)
OptbinningOptimal binning: monotonic binning with constraints. Support batch & stream optimal binning
MachineMachine is a workflow/pipeline library for processing data
TrillTrill is a single-node query processor for temporal or streaming data.
NsdbNatural Series Database
SaberWindow-Based Hybrid CPU/GPU Stream Processing Engine
Go MeshRealtime data exchange platform for Smart Cities
StreamzReal-time stream processing for python
Go StreamsA lightweight stream processing library for Go
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Scikit MultiflowA machine learning package for streaming data in Python. The other ancestor of River.
SwimDistributed software platform for building stateful, massively real-time streaming applications.
Rrcf🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
BenthosFancy stream processing made operationally mundane
CloudflowCloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
meros🪢 A fast utility that makes reading multipart responses simple
icicleIcicle Streaming Query Language
transitMassively real-time city transit streaming application
richflowA Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
godsendA simple and eloquent workflow for streaming messages to micro-services.
cinjeA Pythonic and ultra fast template engine DSL.
awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
twitter-stream-api🐤 Another Twitter stream PHP library to retrieve filtered tweets on hot.
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
mxfactoriala payment application intended for deployment by the united states treasury
openPDCOpen Source Phasor Data Concentrator
you-cant-download-this-imageDownloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD