TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+635.29%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (+17.65%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (+135.29%)
SSISRabbitMQCustom SSIS Components for RabbitMQ
Stars: ✭ 37 (+117.65%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+2605.88%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+88.24%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (+23.53%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-5.88%)
DataformDataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+1911.76%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (+11.76%)
Ananas DesktopA hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Stars: ✭ 551 (+3141.18%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+658.82%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (+547.06%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+129.41%)
metadataoracle,mysql,sql server 元数据管理表生成
Stars: ✭ 45 (+164.71%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+123.53%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+88.24%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+2023.53%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (+276.47%)
SQLServerToolsThis repo is the home of various SQL-Server-Tools
Stars: ✭ 28 (+64.71%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (+394.12%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+747.06%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (+635.29%)
Metlmito ETL tool
Stars: ✭ 153 (+800%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+905.88%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+4217.65%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+641.18%)
Pyetlpython ETL framework
Stars: ✭ 33 (+94.12%)
SSISMHashSSIS Multiple Hash makes it possible to generate many Hash values from each input row. Hash's supported include MD5 and SHA1.
Stars: ✭ 32 (+88.24%)
tsql-scriptsTransact-SQL scripts and gists
Stars: ✭ 35 (+105.88%)
YelpDatasetSQLWorking with the Yelp Dataset in Azure SQL and SQL Server
Stars: ✭ 16 (-5.88%)
dswarman open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+235.29%)
TEAMThe Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (+58.82%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+3500%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+711.76%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+41.18%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+294.12%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (+64.71%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+2088.24%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+1094.12%)
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (+500%)
sp who3The sp_who3 stored procedure is a custom and open source alternative to the sp_who system stored procedures available in SQL Server.
Stars: ✭ 49 (+188.24%)
dtd2mysqlMySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (+47.06%)
BlockHashLocRecover files using lists of blocks hashes, bypassing the File System entirely
Stars: ✭ 45 (+164.71%)
scifscientific filesystem: a filesystem organization for scientific software and metadata
Stars: ✭ 30 (+76.47%)
scihubCopernicus Sentinel Science Hub rolling archive downloader
Stars: ✭ 28 (+64.71%)
django-music-publisherSoftware for managing music metadata, registration/licencing of musical works and royalty processing.
Stars: ✭ 46 (+170.59%)
metabadgerPrevent SSRF attacks on AWS EC2 via automated upgrades to the more secure Instance Metadata Service v2 (IMDSv2).
Stars: ✭ 123 (+623.53%)
riscv-metaRISC-V Instruction Set Metadata
Stars: ✭ 33 (+94.12%)
soddiStackOverflow Data Dump Importer. Forked from https://bitbucket.org/bitpusher/soddi/ after the original author passed away.
Stars: ✭ 74 (+335.29%)
CommonSQL FineBuild provides 1-click install and best-practice configuration on Windows of SQL Server 2019 through to SQL Server 2005
Stars: ✭ 32 (+88.24%)
dask-sqlDistributed SQL Engine in Python using Dask
Stars: ✭ 271 (+1494.12%)
openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+347.06%)
migrationsMigrations is a database migration tool that uses go's database/sql from the standard library
Stars: ✭ 17 (+0%)
wikirepoPython based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (+94.12%)