All Projects → Smooks → Similar Projects or Alternatives

1318 Open source projects that are alternatives of or similar to Smooks

Go Streams
A lightweight stream processing library for Go
Stars: ✭ 615 (+109.9%)
Mutual labels:  etl, stream-processing, pipelines
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+23.21%)
Mutual labels:  big-data, etl
Argo Events
Event-driven workflow automation framework
Stars: ✭ 821 (+180.2%)
Mutual labels:  event-driven, pipelines
Kafka Streams
equivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+109.22%)
Mutual labels:  big-data, stream-processing
Hazelcast Jet
Distributed Stream and Batch Processing
Stars: ✭ 855 (+191.81%)
Mutual labels:  big-data, stream-processing
Hale
(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-71.33%)
Mutual labels:  xml, etl
Psi
Platform for Situated Intelligence
Stars: ✭ 249 (-15.02%)
Mutual labels:  stream-processing, pipelines
Hazelcast
Open-source distributed computation and storage platform
Stars: ✭ 4,662 (+1491.13%)
Mutual labels:  big-data, stream-processing
Omniparser
omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (-49.49%)
Mutual labels:  xml, etl
bandar-log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-93.17%)
Mutual labels:  big-data, etl
Fluentmediator
🔀 FluentMediator is an unobtrusive library that allows developers to build custom pipelines for Commands, Queries and Events.
Stars: ✭ 128 (-56.31%)
Mutual labels:  event-driven, pipelines
Riko
A Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (+436.18%)
Mutual labels:  etl, stream-processing
Tuna
🐟 A streaming ETL for fish
Stars: ✭ 11 (-96.25%)
Mutual labels:  etl, stream-processing
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-73.04%)
Mutual labels:  big-data, etl
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (-52.22%)
Mutual labels:  big-data, etl
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+1164.51%)
Mutual labels:  etl, stream-processing
football-events
Event-Driven microservices with Kafka Streams
Stars: ✭ 57 (-80.55%)
Mutual labels:  stream-processing, event-driven
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+1578.84%)
Mutual labels:  etl, pipelines
talaria
TalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (-49.49%)
Mutual labels:  big-data, stream-processing
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (-50.85%)
Mutual labels:  big-data, etl
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-19.8%)
Mutual labels:  big-data, etl
Marklogic Data Hub
The MarkLogic Data Hub: documentation ==>
Stars: ✭ 113 (-61.43%)
Mutual labels:  xml, etl
Stroom
Stroom is a highly scalable data storage, processing and analysis platform.
Stars: ✭ 344 (+17.41%)
Mutual labels:  xml, big-data
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-94.2%)
Mutual labels:  big-data, etl
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-77.13%)
Mutual labels:  etl, stream-processing
storm-ml
an online learning algorithm library for Storm
Stars: ✭ 18 (-93.86%)
Mutual labels:  big-data, stream-processing
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-80.55%)
Mutual labels:  etl, stream-processing
Logrange
High performance data aggregating storage
Stars: ✭ 181 (-38.23%)
Mutual labels:  stream-processing, pipelines
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-86.69%)
Mutual labels:  big-data, etl
dspatch
The Refreshingly Simple Cross-Platform C++ Dataflow / Pipelining / Stream Processing / Reactive Programming Framework
Stars: ✭ 124 (-57.68%)
Mutual labels:  pipelines, stream-processing
Aws Etl Orchestrator
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (-16.38%)
Mutual labels:  big-data, etl
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-66.89%)
Mutual labels:  big-data, stream-processing
Choetl
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+26.96%)
Mutual labels:  xml, etl
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-93.52%)
Mutual labels:  big-data, etl
Watermill
Building event-driven applications the easy way in Go.
Stars: ✭ 3,504 (+1095.9%)
Mutual labels:  event-driven, stream-processing
EsperIoT
Small and simple stream-based CEP tool for IoT devices connected to an MQTT broker
Stars: ✭ 18 (-93.86%)
Mutual labels:  stream-processing, event-driven
vxquery
Mirror of Apache VXQuery
Stars: ✭ 19 (-93.52%)
Mutual labels:  big-data, xml
Datavec
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (-7.17%)
Mutual labels:  etl
Waterfall Toolbar
Stars: ✭ 282 (-3.75%)
Mutual labels:  xml
Datahub
The Metadata Platform for the Modern Data Stack
Stars: ✭ 4,232 (+1344.37%)
Mutual labels:  big-data
Shapeofview
Give a custom shape to any android view, Material Design 2 ready
Stars: ✭ 2,977 (+916.04%)
Mutual labels:  xml
Keda
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
Stars: ✭ 4,015 (+1270.31%)
Mutual labels:  event-driven
Triggers
Event triggering with Tekton!
Stars: ✭ 279 (-4.78%)
Mutual labels:  pipelines
Deck
Slide Decks
Stars: ✭ 261 (-10.92%)
Mutual labels:  xml
Tableexport
tableExport(table导出文件,支持json、csv、txt、xml、word、excel、image、pdf)
Stars: ✭ 261 (-10.92%)
Mutual labels:  xml
Dita Ot
DITA Open Toolkit — the open-source XML publishing engine for content authored in the Darwin Information Typing Architecture.
Stars: ✭ 279 (-4.78%)
Mutual labels:  xml
Xreader
XML, NEWS, RSS & Scrapping Reader maked in Xamarin, for educational purpose.
Stars: ✭ 259 (-11.6%)
Mutual labels:  xml
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+912.29%)
Mutual labels:  pipelines
Crate
CrateDB is a distributed SQL database that makes it simple to store and analyze massive amounts of data in real-time.
Stars: ✭ 3,254 (+1010.58%)
Mutual labels:  big-data
Htmlparser2
The fast & forgiving HTML and XML parser
Stars: ✭ 3,299 (+1025.94%)
Mutual labels:  xml
Sweet xml
Stars: ✭ 279 (-4.78%)
Mutual labels:  xml
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+890.78%)
Mutual labels:  xml
Serverlessbydesign
A visual approach to serverless development. Think. Build. Repeat.
Stars: ✭ 254 (-13.31%)
Mutual labels:  event-driven
Succinct
Enabling queries on compressed data.
Stars: ✭ 257 (-12.29%)
Mutual labels:  big-data
Substance
A JavaScript library for web-based content editing.
Stars: ✭ 2,737 (+834.13%)
Mutual labels:  xml
Treescale
Event/Data distribution system without any configuration, but with data delivery guarantees
Stars: ✭ 286 (-2.39%)
Mutual labels:  event-driven
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1463.48%)
Mutual labels:  big-data
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-95.22%)
Mutual labels:  etl
kerala
Distributed KV Streams
Stars: ✭ 16 (-94.54%)
Mutual labels:  stream-processing
Pubmed parser
📋 A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
Stars: ✭ 274 (-6.48%)
Mutual labels:  xml
1-60 of 1318 similar projects