link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-84.24%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-89.66%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-80.79%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (-15.76%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-68.47%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-29.06%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (-80.3%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+201.48%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (-38.42%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (-32.02%)
Metlmito ETL tool
Stars: ✭ 153 (-24.63%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+83.25%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-92.12%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+77.83%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+261.58%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-58.62%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-88.18%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-67%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-86.21%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (-37.93%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-90.15%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (-38.42%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-91.63%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-81.28%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+126.6%)
Pyetlpython ETL framework
Stars: ✭ 33 (-83.74%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+814.29%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-47.29%)
Omniparseromniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (-27.09%)
Kafka Connectequivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (-49.75%)
Csv2dbThe CSV to database command line loader
Stars: ✭ 102 (-49.75%)
OdČeská otevřená data
Stars: ✭ 99 (-51.23%)
Open Data Etl Utility KitUse Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data portal. Documentation is available at http://open-data-etl-utility-kit.readthedocs.io/en/stable
Stars: ✭ 93 (-54.19%)
MetlMetl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (-8.87%)
LogstashLogstash - transport and process your logs, events, or other data
Stars: ✭ 12,543 (+6078.82%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-31.03%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+806.9%)
EtlLinkedPipes ETL is an RDF based, lightweight ETL tool
Stars: ✭ 88 (-56.65%)
Dig Etl EngineDownload DIG to run on your laptop or server.
Stars: ✭ 81 (-60.1%)
Linq2dbLinq to database provider.
Stars: ✭ 2,211 (+989.16%)
Kettle Web基于spring boot通过java代码调用kette
Stars: ✭ 128 (-36.95%)
KgtkKnowledge Graph Toolkit
Stars: ✭ 81 (-60.1%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-61.08%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (-36.45%)
SaynData processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-61.08%)
Data StoryA visual process builder for Laravel
Stars: ✭ 71 (-65.02%)
ExtractA cross-platform command line tool for parallelised content extraction and analysis.
Stars: ✭ 188 (-7.39%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+2323.15%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (-36.45%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+488.67%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-64.04%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-64.53%)
Usaspending ApiServer application to serve U.S. federal spending data via a RESTful API
Stars: ✭ 166 (-18.23%)
TransporterSync data between persistence engines, like ETL only not stodgy
Stars: ✭ 1,175 (+478.82%)
GlobalbioticinteractionsGlobal Biotic Interactions provides access to existing species interaction datasets
Stars: ✭ 71 (-65.02%)