All Categories → Control Flow → airflow

Top 99 airflow open source projects

Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Paperboy
A web frontend for scheduling Jupyter notebook reports
Airflow Scheduler Failover Controller
A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability
Soda Sql
Metric collection, data testing and monitoring for SQL accessible data
Airflow Testing
Airflow Unit Tests and Integration Tests
Airflow Doc Zh
📖 [译] Airflow 中文文档
✭ 169
cssairflow
Airflow Exporter
Airflow plugin to export dag and task based metrics to Prometheus.
Data Science Stack Cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Airflow Chart
A Helm chart to install Apache Airflow on Kubernetes
Airflow Autoscaling Ecs
Airflow Deployment on AWS ECS Fargate Using Cloudformation
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Telemetry Airflow
Airflow configuration for Telemetry
Afctl
afctl helps to manage and deploy Apache Airflow projects faster and smoother.
Whirl
Fast iterative local development and testing of Apache Airflow workflows
Airflow in docker compose
Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Airflow Training
Airflow training for the crunch conf
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Terraform Aws Airflow
Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker with CeleryExecutor
Discreetly
ETLy is an add-on dashboard service on top of Apache Airflow.
Airflow Cookbook
Airflow workflow management platform chef cookbook.
Xene
A distributed workflow runner focusing on performance and simplicity.
Airflow Toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Data Pipelines With Apache Airflow
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
Airflow On Kubernetes
Bare minimal Airflow on Kubernetes (Local, EKS, AKS)
Objinsync
Continuously synchronize directories from remote object store to local filesystem
Docker Airflow
Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Airflow Maintenance Dags
A series of DAGs/Workflows to help maintain the operation of Airflow
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Incubator Dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Dag Factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
Aws Airflow Stack
Turbine: the bare metals that gets you Airflow
Airflow Tutorial
Airflow basics tutorial
Airflow Rest Api Plugin
A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
Airflow Operator
Kubernetes custom controller and CRDs to managing Airflow
dbt-on-airflow
No description or website provided.
airflow-user-management-plugin
A plugin for Apache Airflow that allows you to manage the users that can login
udacity-data-eng-proj2
A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.
airflow-code-editor
A plugin for Apache Airflow that allows you to edit DAGs in browser
ecs-airflow
Cloudformation templates for deploying Airflow in ECS
1-60 of 99 airflow projects