DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+14.29%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-4.76%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (+90.48%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+219.05%)
Pyetlpython ETL framework
Stars: ✭ 33 (+57.14%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (+33.33%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (+0%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-19.05%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2814.29%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+1671.43%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+85.71%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+80.95%)
DebeziumChange data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Stars: ✭ 5,937 (+28171.43%)
KafdropKafka Web UI
Stars: ✭ 3,158 (+14938.1%)
BenthosFancy stream processing made operationally mundane
Stars: ✭ 3,705 (+17542.86%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (+204.76%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+1619.05%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (+300%)
incubator-eventmeshEventMesh is a dynamic event-driven application runtime used to decouple the application and backend middleware layer, which supports a wide range of use cases that encompass complex multi-cloud, widely distributed topologies using diverse technology stacks.
Stars: ✭ 939 (+4371.43%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+714.29%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+866.67%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-23.81%)
dbddbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (+42.86%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+495.24%)
pg2k4jPostgresql To Kinesis For Java
Stars: ✭ 69 (+228.57%)
Spreplicator♻ Replicates SharePoint Lists
Stars: ✭ 22 (+4.76%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+8738.1%)
kafka-connect-datagenA Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+28.57%)
pgcaptureA scalable Netflix DBLog implementation for PostgreSQL
Stars: ✭ 94 (+347.62%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+3395.24%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+2090.48%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (+495.24%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+514.29%)
Metlmito ETL tool
Stars: ✭ 153 (+628.57%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+585.71%)
MySqlCdcMySQL/MariaDB binlog replication client for .NET
Stars: ✭ 71 (+238.1%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+500%)
PglogicalLogical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
Stars: ✭ 455 (+2066.67%)
walrusApplying RLS to PostgreSQL WAL
Stars: ✭ 59 (+180.95%)
Omniparseromniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (+604.76%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-4.76%)
Csv2dbThe CSV to database command line loader
Stars: ✭ 102 (+385.71%)
Etl with pythonETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (+223.81%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+52.38%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-9.52%)
Ether sqlA python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (+95.24%)
commanderBuild event-driven and event streaming applications with ease
Stars: ✭ 60 (+185.71%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+557.14%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-28.57%)
Ethereum EtlPython scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+4452.38%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+52.38%)
dswarman open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+171.43%)
pg keeperSimplified clustering module for PostgreSQL
Stars: ✭ 32 (+52.38%)
flighthubFlight ticket booking system implemented with CQRS and ES.
Stars: ✭ 26 (+23.81%)
invoices-cliGenerates html and pdf invoices using html template files, CSV databases for products, clients, and transactions
Stars: ✭ 34 (+61.9%)
stock-market-scraperScraps historical stock market data from Yahoo Finance (https://finance.yahoo.com/)
Stars: ✭ 110 (+423.81%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+109.52%)