versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+860%)
beneathBeneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+333.33%)
Mongo EsA MongoDB to Elasticsearch connector
Stars: ✭ 185 (+1133.33%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (+33.33%)
Usaspending ApiServer application to serve U.S. federal spending data via a RESTful API
Stars: ✭ 166 (+1006.67%)
Bulk WriterProvides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (+1300%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (+33.33%)
Bitcoin EtlETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+1060%)
wikirepoPython based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (+120%)
Metlmito ETL tool
Stars: ✭ 153 (+920%)
thainThain is a distributed flow schedule platform.
Stars: ✭ 81 (+440%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+12173.33%)
Etl2pcapngUtility that converts an .etl file containing a Windows network packet capture into .pcapng format.
Stars: ✭ 228 (+1420%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (+1206.67%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+113.33%)
Linq2dbLinq to database provider.
Stars: ✭ 2,211 (+14640%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (+26.67%)
id3cData logistics system enabling real-time pathogen surveillance. Built for the Seattle Flu Study.
Stars: ✭ 21 (+40%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+860%)
DataX-srcDataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (+40%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (+760%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+1533.33%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+740%)
StoragetapperStorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+1446.67%)
ElasticR client for the Elasticsearch HTTP API
Stars: ✭ 227 (+1413.33%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (+633.33%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+1253.33%)
historyDownload and warehouse historical trading data
Stars: ✭ 28 (+86.67%)
ExtractA cross-platform command line tool for parallelised content extraction and analysis.
Stars: ✭ 188 (+1153.33%)
openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+406.67%)
MetlMetl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (+1133.33%)
krawlerA minimalist (geospatial) ETL
Stars: ✭ 51 (+240%)
GrafterLinked Data & RDF Manufacturing Tools in Clojure
Stars: ✭ 174 (+1060%)
dtd2mysqlMySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (+66.67%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (+1040%)
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (+580%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+32693.33%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (+13.33%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (+1000%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (+266.67%)
Mara Example Project 2An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+926.67%)
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+1760%)
Omniparseromniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (+886.67%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (+166.67%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+833.33%)
iex-stocksETL for the IEX Stocks API
Stars: ✭ 19 (+26.67%)
Kettle Web基于spring boot通过java代码调用kette
Stars: ✭ 128 (+753.33%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+760%)
chronicle-etl📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (+420%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+733.33%)
Example Airflow DagsExample DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+1520%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (+300%)
zdh server数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (+253.33%)
awesome-integrationA curated list of awesome system integration software and resources.
Stars: ✭ 117 (+680%)