DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-85.51%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-87.68%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-71.74%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (-71.01%)
Pyetlpython ETL framework
Stars: ✭ 33 (-76.09%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+4.35%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+161.59%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+431.88%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+233.33%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-72.46%)
Metlmito ETL tool
Stars: ✭ 153 (+10.87%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-79.71%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-39.13%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (-9.42%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+343.48%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-88.41%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+47.1%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-51.45%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-82.61%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-84.78%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+169.57%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (-9.42%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (-8.7%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-53.62%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+23.91%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-76.81%)
dtd2mysqlMySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (-81.88%)
django-data-migrationData migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (-86.96%)
wikirepoPython based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-76.09%)
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+102.17%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-58.7%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-44.2%)
DataX-srcDataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-84.78%)
chronicle-etl📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (-43.48%)
kettleJava调用Kettle API执行转换和作业,Java代码生成Kettle转换。
Stars: ✭ 21 (-84.78%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-65.94%)
starlakeStarlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-88.41%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+58.7%)
krawlerA minimalist (geospatial) ETL
Stars: ✭ 51 (-63.04%)
google-sheets-etlLive import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-89.13%)
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (-26.09%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-61.59%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-85.51%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (-60.14%)
python mozetlETL jobs for Firefox Telemetry
Stars: ✭ 25 (-81.88%)
django-calaccess-raw-dataA Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (-55.8%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (-56.52%)
id3cData logistics system enabling real-time pathogen surveillance. Built for the Seattle Flu Study.
Stars: ✭ 21 (-84.78%)
thainThain is a distributed flow schedule platform.
Stars: ✭ 81 (-41.3%)
zdh server数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (-61.59%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+77.54%)
Example Airflow DagsExample DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+76.09%)
proc-thatproc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (-81.88%)