cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-90.95%)
dlinkDinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
Stars: ✭ 1,535 (+561.64%)
chronicle-etl📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (-66.38%)
wikirepoPython based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-85.78%)
google-sheets-etlLive import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-93.53%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-91.38%)
zdh web大数据采集,抽取平台
Stars: ✭ 292 (+25.86%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-91.81%)
thainThain is a distributed flow schedule platform.
Stars: ✭ 81 (-65.09%)
KuiBaDBAnother OLAP database
Stars: ✭ 297 (+28.02%)
DataX-srcDataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-90.95%)
proc-thatproc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (-89.22%)
functionsAn Open Source Serverless Platform
Stars: ✭ 44 (-81.03%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (-74.14%)
starlakeStarlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-93.1%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (-76.29%)
aws-tailorAWS account provisioning and management service
Stars: ✭ 105 (-54.74%)
CloudFrontierMonitor the internet attack surface of various public cloud environments. Currently supports AWS, GCP, Azure, DigitalOcean and Oracle Cloud.
Stars: ✭ 102 (-56.03%)
django-data-migrationData migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (-92.24%)
iex-stocksETL for the IEX Stocks API
Stars: ✭ 19 (-91.81%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+5.6%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+1.29%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-86.21%)
dtd2mysqlMySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (-89.22%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-83.19%)
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+20.26%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (-5.6%)
jsitemapgeneratorJava sitemap generator. This library generates a web sitemap, can ping Google, generate RSS feed, robots.txt and more with friendly, easy to use Java 8 functional style of programming
Stars: ✭ 38 (-83.62%)
lmdrouterGo HTTP router library for AWS API Gateway-invoked Lambda Functions
Stars: ✭ 121 (-47.84%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+182.33%)
krawlerA minimalist (geospatial) ETL
Stars: ✭ 51 (-78.02%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-92.67%)
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (-56.03%)
sync-engine-exampleSynchronization Algorithm Exploration: Techniques to synchronize a SQL database with external destinations.
Stars: ✭ 17 (-92.67%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-91.38%)
zdh server数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (-77.16%)
id3cData logistics system enabling real-time pathogen surveillance. Built for the Seattle Flu Study.
Stars: ✭ 21 (-90.95%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (-86.21%)
openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (-67.24%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (-82.76%)
ShadowCloneUnleash the power of cloud
Stars: ✭ 224 (-3.45%)
metriqlThe metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (-2.16%)
Example Airflow DagsExample DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+4.74%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-66.81%)
StoragetapperStorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+0%)
awesome-integrationA curated list of awesome system integration software and resources.
Stars: ✭ 117 (-49.57%)
terraform-aws-lambda-functionA Terraform module for deploying and managing Lambda functions on Amazon Web Services (AWS). https://aws.amazon.com/lambda/
Stars: ✭ 37 (-84.05%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-75.43%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-77.16%)
testimonialJamstack app using Gatsby, Netlify, and FaunaDB.
Stars: ✭ 23 (-90.09%)