Metlmito ETL tool
Stars: ✭ 153 (+139.06%)
DatavecETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (+325%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+618.75%)
Bulk WriterProvides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (+228.13%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-68.75%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+242.19%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-56.25%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+217.19%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+23.44%)
Pyetlpython ETL framework
Stars: ✭ 33 (-48.44%)
lineageGenerate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-75%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+4.69%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-39.06%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-73.44%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-75%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+96.88%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+125%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+7585.94%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (+31.25%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+115.63%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+856.25%)
mydataharbor🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (-56.25%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-62.5%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+464.06%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+481.25%)
Koop🔮 Transform, query, and download geospatial data on the web.
Stars: ✭ 505 (+689.06%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-60.94%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (-37.5%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-40.62%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+2776.56%)
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-73.44%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-67.19%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+1046.88%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+95.31%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (+95.31%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+167.19%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-50%)
etlM-Lab ingestion pipeline
Stars: ✭ 15 (-76.56%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (+860.94%)
GeohealthcheckService Status and QoS Checker for OGC Web Services
Stars: ✭ 52 (-18.75%)
Ensembl HiveEnsEMBL Hive - a system for creating and running pipelines on a distributed compute resource
Stars: ✭ 44 (-31.25%)
Otmaps基于ArcGIS API for JavaScript封装的专题图制图类库
Stars: ✭ 44 (-31.25%)
MetamorphMorphing mod for Minecraft 1.12.2
Stars: ✭ 52 (-18.75%)
Jenkins OsGroovy pipeline jobs that build and test Container Linux with Jenkins
Stars: ✭ 43 (-32.81%)
Jenkins Workflowcontains handy groovy workflow-libs scripts
Stars: ✭ 41 (-35.94%)
InlocoA Geographic Information System (GIS) used by Ministério Público do Estado do Rio de Janeiro to show social, institutional and administrative data , based on React and Leaflet, interacting with a GeoServer back-end.
Stars: ✭ 51 (-20.31%)
Ether sqlA python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-35.94%)
Shapefile.jlParsing .shp files in Julia
Stars: ✭ 40 (-37.5%)
Ee RunnerCommand-line runner for Google Earth Engine Playground scripts
Stars: ✭ 59 (-7.81%)
Drake ExamplesExample workflows for the drake R package
Stars: ✭ 57 (-10.94%)
3d TilesSpecification for streaming massive heterogeneous 3D geospatial datasets 🌎
Stars: ✭ 1,054 (+1546.88%)
AlchemistA realtime ETL engine
Stars: ✭ 40 (-37.5%)
RiversData Stream Processing API for GO
Stars: ✭ 39 (-39.06%)
Dawn🌅 Dawn is a lightweight task management and build tool for front-end and nodejs.
Stars: ✭ 1,057 (+1551.56%)
Intro spatialrIntroduction to GIS and mapping in R with the sf package
Stars: ✭ 39 (-39.06%)