Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (+109.9%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+23.21%)
Argo EventsEvent-driven workflow automation framework
Stars: ✭ 821 (+180.2%)
Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+109.22%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+191.81%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-71.33%)
PsiPlatform for Situated Intelligence
Stars: ✭ 249 (-15.02%)
HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+1491.13%)
Omniparseromniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (-49.49%)
bandar-logMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-93.17%)
Fluentmediator🔀 FluentMediator is an unobtrusive library that allows developers to build custom pipelines for Commands, Queries and Events.
Stars: ✭ 128 (-56.31%)
RikoA Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (+436.18%)
Tuna🐟 A streaming ETL for fish
Stars: ✭ 11 (-96.25%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-73.04%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-52.22%)
BenthosFancy stream processing made operationally mundane
Stars: ✭ 3,705 (+1164.51%)
football-eventsEvent-Driven microservices with Kafka Streams
Stars: ✭ 57 (-80.55%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+1578.84%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (-49.49%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-50.85%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-19.8%)
StroomStroom is a highly scalable data storage, processing and analysis platform.
Stars: ✭ 344 (+17.41%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-77.13%)
storm-mlan online learning algorithm library for Storm
Stars: ✭ 18 (-93.86%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-80.55%)
LograngeHigh performance data aggregating storage
Stars: ✭ 181 (-38.23%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-86.69%)
dspatchThe Refreshingly Simple Cross-Platform C++ Dataflow / Pipelining / Stream Processing / Reactive Programming Framework
Stars: ✭ 124 (-57.68%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (-16.38%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-66.89%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+26.96%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-93.52%)
WatermillBuilding event-driven applications the easy way in Go.
Stars: ✭ 3,504 (+1095.9%)
EsperIoTSmall and simple stream-based CEP tool for IoT devices connected to an MQTT broker
Stars: ✭ 18 (-93.86%)
vxqueryMirror of Apache VXQuery
Stars: ✭ 19 (-93.52%)
DatavecETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (-7.17%)
DatahubThe Metadata Platform for the Modern Data Stack
Stars: ✭ 4,232 (+1344.37%)
ShapeofviewGive a custom shape to any android view, Material Design 2 ready
Stars: ✭ 2,977 (+916.04%)
KedaKEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
Stars: ✭ 4,015 (+1270.31%)
TriggersEvent triggering with Tekton!
Stars: ✭ 279 (-4.78%)
DeckSlide Decks
Stars: ✭ 261 (-10.92%)
TableexporttableExport(table导出文件,支持json、csv、txt、xml、word、excel、image、pdf)
Stars: ✭ 261 (-10.92%)
Dita OtDITA Open Toolkit — the open-source XML publishing engine for content authored in the Darwin Information Typing Architecture.
Stars: ✭ 279 (-4.78%)
XreaderXML, NEWS, RSS & Scrapping Reader maked in Xamarin, for educational purpose.
Stars: ✭ 259 (-11.6%)
PolyaxonMachine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+912.29%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+1010.58%)
Htmlparser2The fast & forgiving HTML and XML parser
Stars: ✭ 3,299 (+1025.94%)
Php Curl ClassPHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+890.78%)
ServerlessbydesignA visual approach to serverless development. Think. Build. Repeat.
Stars: ✭ 254 (-13.31%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-12.29%)
SubstanceA JavaScript library for web-based content editing.
Stars: ✭ 2,737 (+834.13%)
TreescaleEvent/Data distribution system without any configuration, but with data delivery guarantees
Stars: ✭ 286 (-2.39%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1463.48%)
etl managerA python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-95.22%)
keralaDistributed KV Streams
Stars: ✭ 16 (-94.54%)
Pubmed parser📋 A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
Stars: ✭ 274 (-6.48%)