All Projects → Discreetly → Similar Projects or Alternatives

293 Open source projects that are alternatives of or similar to Discreetly

AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-66.67%)
Mutual labels:  airflow, etl
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+305%)
Mutual labels:  airflow, etl
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (+48.33%)
Mutual labels:  airflow, etl
Phila Airflow
Stars: ✭ 16 (-73.33%)
Mutual labels:  airflow, etl
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+1891.67%)
Mutual labels:  airflow, etl
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+31.67%)
Mutual labels:  airflow, etl
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (+78.33%)
Mutual labels:  airflow, etl
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-11.67%)
Mutual labels:  airflow, etl
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-60%)
Mutual labels:  airflow, etl
Koop
🔮 Transform, query, and download geospatial data on the web.
Stars: ✭ 505 (+741.67%)
Mutual labels:  etl
Dswarm Backoffice Web
The backoffice web application of d:swarm (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 11 (-81.67%)
Mutual labels:  etl
Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+663.33%)
Mutual labels:  airflow
Baby Names Analysis
Data ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'.
Stars: ✭ 557 (+828.33%)
Mutual labels:  etl
Aws Auto Terminate Idle Emr
AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-65%)
Mutual labels:  etl
Smartcode
SmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!!
Stars: ✭ 464 (+673.33%)
Mutual labels:  etl
Airflow On Kubernetes
Bare minimal Airflow on Kubernetes (Local, EKS, AKS)
Stars: ✭ 38 (-36.67%)
Mutual labels:  airflow
Airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+40068.33%)
Mutual labels:  airflow
Elyra
Elyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+1298.33%)
Mutual labels:  airflow
Datacleaner
The premier open source Data Quality solution
Stars: ✭ 391 (+551.67%)
Mutual labels:  etl
Abc
Power of appbase.io via CLI, with nifty imports from your favorite data sources
Stars: ✭ 375 (+525%)
Mutual labels:  etl
Data Pipelines With Apache Airflow
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
Stars: ✭ 50 (-16.67%)
Mutual labels:  airflow
Pyetl
python ETL framework
Stars: ✭ 33 (-45%)
Mutual labels:  etl
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-68.33%)
Mutual labels:  etl
Aistore
AIStore: scalable storage for AI applications
Stars: ✭ 367 (+511.67%)
Mutual labels:  etl
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+501.67%)
Mutual labels:  etl
Aws Airflow Stack
Turbine: the bare metals that gets you Airflow
Stars: ✭ 352 (+486.67%)
Mutual labels:  airflow
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+470%)
Mutual labels:  etl
Incubator Dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Stars: ✭ 6,916 (+11426.67%)
Mutual labels:  airflow
Yunmai Data Extract
Extract your data from the Yunmai weighing scales cloud API so you can use it elsewhere
Stars: ✭ 21 (-65%)
Mutual labels:  etl
Ananas Desktop
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Stars: ✭ 551 (+818.33%)
Mutual labels:  etl
Ether sql
A python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-31.67%)
Mutual labels:  etl
Bigslice
A serverless cluster computing system for the Go programming language
Stars: ✭ 469 (+681.67%)
Mutual labels:  etl
Panther
Detect threats with log data and improve cloud security posture
Stars: ✭ 885 (+1375%)
Mutual labels:  etl
Etlalchemy
Extract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+666.67%)
Mutual labels:  etl
Argo Workflows
Workflow engine for Kubernetes
Stars: ✭ 10,024 (+16606.67%)
Mutual labels:  airflow
Pglogical
Logical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
Stars: ✭ 455 (+658.33%)
Mutual labels:  etl
Tuna
🐟 A streaming ETL for fish
Stars: ✭ 11 (-81.67%)
Mutual labels:  etl
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+588.33%)
Mutual labels:  airflow
Configs
Public, free to use, repository with diggers configs for scraping / extracting data from various e-commerce websites and online stores
Stars: ✭ 37 (-38.33%)
Mutual labels:  etl
Dag Factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
Stars: ✭ 385 (+541.67%)
Mutual labels:  airflow
Databook
A facebook for data
Stars: ✭ 26 (-56.67%)
Mutual labels:  airflow
Choetl
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+520%)
Mutual labels:  etl
Xene
A distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (-6.67%)
Mutual labels:  airflow
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+520%)
Mutual labels:  etl
Automating Your Data Pipeline With Apache Airflow
Automating Your Data Pipeline with Apache Airflow
Stars: ✭ 19 (-68.33%)
Mutual labels:  airflow
Objinsync
Continuously synchronize directories from remote object store to local filesystem
Stars: ✭ 29 (-51.67%)
Mutual labels:  airflow
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+1221.67%)
Mutual labels:  airflow
Webkettle
基于web版kettle开发的一套分布式综合调度,管理,ETL开发的用户专业版B/S架构工具
Stars: ✭ 334 (+456.67%)
Mutual labels:  etl
Airflow Tutorial
Airflow basics tutorial
Stars: ✭ 305 (+408.33%)
Mutual labels:  airflow
Docker Airflow
Docker Apache Airflow
Stars: ✭ 3,375 (+5525%)
Mutual labels:  airflow
Kiba Plus
Kiba enhancement for Ruby ETL.
Stars: ✭ 47 (-21.67%)
Mutual labels:  etl
Ethereum Etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+1493.33%)
Mutual labels:  etl
Getting Started
This repository is a getting started guide to Singer.
Stars: ✭ 734 (+1123.33%)
Mutual labels:  etl
Smooks
An extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (+388.33%)
Mutual labels:  etl
Dagster
An orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+6731.67%)
Mutual labels:  etl
Monstache
a go daemon that syncs MongoDB to Elasticsearch in realtime
Stars: ✭ 736 (+1126.67%)
Mutual labels:  etl
Airflow Rest Api Plugin
A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
Stars: ✭ 281 (+368.33%)
Mutual labels:  airflow
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+6075%)
Mutual labels:  etl
Docker Airflow
Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Stars: ✭ 29 (-51.67%)
Mutual labels:  airflow
React Csv
React components to build CSV files on the fly basing on Array/literal object of data
Stars: ✭ 732 (+1120%)
Mutual labels:  etl
1-60 of 293 similar projects