vixtractwww.vixtract.ru
Stars: ✭ 40 (+100%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+90%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+235%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-15%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2960%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+20%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+95%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+915%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-5%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (+0%)
TEAMThe Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (+35%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+2200%)
alphasqlAlphaSQL provides Integrated Type and Schema Check and Parallelization for SQL file set mainly for BigQuery
Stars: ✭ 35 (+75%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (+525%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+525%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (+320%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+530%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (+40%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+9180%)
BimlFlex-CommunityCommunity-focused content to supplement working with BimlFlex.
Stars: ✭ 30 (+50%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+60%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+60%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+755%)
Metlmito ETL tool
Stars: ✭ 153 (+665%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+590%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (+5%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-20%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+620%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+1705%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+1760%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+3570%)
tweetsOLAPingimplementing an end-to-end tweets ETL/Analysis pipeline.
Stars: ✭ 24 (+20%)
Pyetlpython ETL framework
Stars: ✭ 33 (+65%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (+220%)
Bulk WriterProvides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (+950%)
Omniparseromniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (+640%)
thainThain is a distributed flow schedule platform.
Stars: ✭ 81 (+305%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+600%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+9105%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (+880%)
Kettle Web基于spring boot通过java代码调用kette
Stars: ✭ 128 (+540%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (+545%)
ExtractA cross-platform command line tool for parallelised content extraction and analysis.
Stars: ✭ 188 (+840%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+545%)
Mongo EsA MongoDB to Elasticsearch connector
Stars: ✭ 185 (+825%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+1125%)
MetlMetl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (+825%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+11825%)
RikoA Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (+7755%)
KibaData processing & ETL framework for Ruby
Stars: ✭ 1,618 (+7990%)
Sentinel CrawlerXenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (+490%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (+175%)
Example Airflow DagsExample DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+1115%)