EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
BenderBender - Serverless ETL Framework
LogstashLogstash - transport and process your logs, events, or other data
HydrographA visual ETL development and debugging tool for big data
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
WaterdropProduction Ready Data Integration Product, documentation:
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
KgtkKnowledge Graph Toolkit
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Pyetlpython ETL framework
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
NofloFlow-based programming for JavaScript
qweryA SQL-like language for performing ETL transformations.
hotsubCommand line tool to run batch jobs concurrently with ETL framework on AWS or other cloud computing resources
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
BETL-oldBETL. Meta data driven ETL generation using T-SQL
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.