All Projects → arthur-redshift-etl → Similar Projects or Alternatives

281 Open source projects that are alternatives of or similar to arthur-redshift-etl

koza
Data transformation framework for LinkML data models
Stars: ✭ 21 (-4.55%)
Mutual labels:  etl
dswarm
an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+159.09%)
Mutual labels:  etl
Everything-Tech
A collection of online resources to help you on your Tech journey.
Stars: ✭ 396 (+1700%)
Mutual labels:  data-engineering
mlbgameday
Multi-core processing of 'Gameday' data from Major League Baseball Advanced Media. Additional tools to parallelize large data sets and write them to a database.
Stars: ✭ 37 (+68.18%)
Mutual labels:  etl
wrangle
A data transformation package for deep learning with Autonomio, Keras and TensorFlow.
Stars: ✭ 15 (-31.82%)
Mutual labels:  etl
zdh server
数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (+140.91%)
Mutual labels:  etl
flock
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+954.55%)
Mutual labels:  etl
web-click-flow
网站点击流离线日志分析
Stars: ✭ 14 (-36.36%)
Mutual labels:  etl
oesophagus
Enterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Stars: ✭ 12 (-45.45%)
Mutual labels:  etl
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-27.27%)
Mutual labels:  etl
gamechanger-data
GAMECHANGER aspires to be the Department’s trusted solution for evidence-based, data-driven decision-making across the universe of DoD requirements
Stars: ✭ 17 (-22.73%)
Mutual labels:  etl
FlowMaster
ETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-13.64%)
Mutual labels:  etl
ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Stars: ✭ 29 (+31.82%)
Mutual labels:  data-engineering
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+72.73%)
Mutual labels:  etl
iex-stocks
ETL for the IEX Stocks API
Stars: ✭ 19 (-13.64%)
Mutual labels:  etl
proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (+13.64%)
Mutual labels:  etl
mydataharbor
🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (+27.27%)
Mutual labels:  etl
datart
Datart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+4636.36%)
Mutual labels:  data-engineering
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-27.27%)
Mutual labels:  etl
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (+1227.27%)
Mutual labels:  etl
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+400%)
Mutual labels:  data-engineering
openrefine-client
The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the command line interface (CLI) and is distributed as a convenient one-file-executable (Windows, Linux, Mac). It is also available via Docker Hub, PyPI and Binder.
Stars: ✭ 67 (+204.55%)
Mutual labels:  etl
prefect-saturn
Python client for using Prefect Cloud with Saturn Cloud
Stars: ✭ 15 (-31.82%)
Mutual labels:  data-engineering
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-27.27%)
Mutual labels:  etl
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+895.45%)
Mutual labels:  etl
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+2054.55%)
Mutual labels:  elt
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-31.82%)
Mutual labels:  etl
cubetl
CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-4.55%)
Mutual labels:  etl
get smarties
Dummy variable generation with fit/transform capabilities
Stars: ✭ 23 (+4.55%)
Mutual labels:  data-engineering
open-semantic-desktop-search
Virtual Machine for Desktop Search with Open Semantic Search
Stars: ✭ 22 (+0%)
Mutual labels:  etl
contessa
Easy way to define, execute and store quality rules for your data.
Stars: ✭ 17 (-22.73%)
Mutual labels:  data-engineering
neon-workshop
A Pachyderm deep learning tutorial for conference workshops
Stars: ✭ 19 (-13.64%)
Mutual labels:  data-engineering
papilo
DEPRECATED: Stream data processing micro-framework
Stars: ✭ 24 (+9.09%)
Mutual labels:  data-engineering
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-22.73%)
Mutual labels:  etl
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+245.45%)
Mutual labels:  etl
mik
The Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+45.45%)
Mutual labels:  etl
big-data-engineering-indonesia
A curated list of big data engineering tools, resources and communities.
Stars: ✭ 26 (+18.18%)
Mutual labels:  data-engineering
TEAM
The Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (+22.73%)
Mutual labels:  etl
awesome-integration
A curated list of awesome system integration software and resources.
Stars: ✭ 117 (+431.82%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-22.73%)
Mutual labels:  etl
neo4j-jdbc
JDBC driver for Neo4j
Stars: ✭ 110 (+400%)
Mutual labels:  etl
link-move
A model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+45.45%)
Mutual labels:  etl
kafka-connect-datagen
A Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+22.73%)
Mutual labels:  etl
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (+81.82%)
Mutual labels:  etl
lrmr
Less-Resilient MapReduce framework for Go
Stars: ✭ 32 (+45.45%)
Mutual labels:  data-engineering
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (+163.64%)
Mutual labels:  data-engineering
cardano-py
Python3 lib and cli for operating a Cardano Passive Node and using the API's. (PRE-ALPHA)
Stars: ✭ 17 (-22.73%)
Mutual labels:  etl
Kaggle-project-list
Summary of my projects on kaggle
Stars: ✭ 20 (-9.09%)
Mutual labels:  data-engineering
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+50%)
Mutual labels:  etl
dtd2mysql
MySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (+13.64%)
Mutual labels:  etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-31.82%)
Mutual labels:  etl
qsv
CSVs sliced, diced & analyzed.
Stars: ✭ 438 (+1890.91%)
Mutual labels:  data-engineering
sql-to-redis
🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (-18.18%)
Mutual labels:  etl
DataX-src
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-4.55%)
Mutual labels:  etl
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (+659.09%)
Mutual labels:  data-engineering
chronicle-etl
📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (+254.55%)
Mutual labels:  etl
pentaho-gis-plugins
🗺 GIS plugins for Pentaho Data Integration
Stars: ✭ 42 (+90.91%)
Mutual labels:  etl
dbt-databricks
A dbt adapter for Databricks.
Stars: ✭ 115 (+422.73%)
Mutual labels:  etl
DQCS
数据质量控制系统
Stars: ✭ 34 (+54.55%)
Mutual labels:  etl
DIRECT
DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-9.09%)
Mutual labels:  etl
61-120 of 281 similar projects