DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-47.37%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+76.32%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-36.84%)
Ethereum EtlPython scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+2415.79%)
Bitcoin EtlETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+357.89%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (+5.26%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+2.63%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1510.53%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+39.47%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+263.16%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-15.79%)
iris3An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (+0%)
hotsubCommand line tool to run batch jobs concurrently with ETL framework on AWS or other cloud computing resources
Stars: ✭ 29 (-23.68%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-44.74%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+850%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+878.95%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+1831.58%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (+121.05%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-57.89%)
Pyetlpython ETL framework
Stars: ✭ 33 (-13.16%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (+228.95%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+231.58%)
go-bqloaderbqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-57.89%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+107.89%)
dbddbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (-21.05%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-55.26%)
Ethereum Etl AirflowAirflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. What datasets do you want to be added to Ethereum ETL? Vote here: https://blockchain-etl.convas.io.
Stars: ✭ 89 (+134.21%)
Metlmito ETL tool
Stars: ✭ 153 (+302.63%)
Mara Example Project 2An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+305.26%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+4784.21%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+50%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-26.32%)
kafka-connect-datagenA Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (-28.95%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+1110.53%)
starlakeStarlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-57.89%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (+68.42%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+228.95%)
Datashare ToolkitDIY commercial datasets on Google Cloud Platform
Stars: ✭ 41 (+7.89%)
argonCampaign Manager 360 and Display & Video 360 Reports to BigQuery connector
Stars: ✭ 31 (-18.42%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+434.21%)
gcp-mlGoogle Cloud Platform Machine Learning Samples
Stars: ✭ 31 (-18.42%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+350%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+278.95%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-47.37%)
bigflowA Python framework for data processing on GCP.
Stars: ✭ 96 (+152.63%)
server-ip-addressesDaily updated list of IP addresses / CIDR blocks used by data centers, cloud service providers, servers, etc.
Stars: ✭ 74 (+94.74%)
gcpGCP Learning stuff.
Stars: ✭ 36 (-5.26%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-63.16%)
nr1-cloud-optimizeNR1 Cloud Optimize allows you to Identify right-sizing opportunities and potential savings of your AWS, GCP, and Azure instances across your cloud environment.
Stars: ✭ 38 (+0%)
pontemOpen source tools for Google Cloud Storage and Databases.
Stars: ✭ 62 (+63.16%)
google-cloudA collection of Google Cloud Platform (GCP) plugins
Stars: ✭ 34 (-10.53%)
gtokenSecurely access AWS services from GKE cluster
Stars: ✭ 43 (+13.16%)
sql-to-redis🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (-52.63%)