Aws Airflow StackTurbine: the bare metals that gets you Airflow
Stars: ✭ 352 (+101.14%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-90.86%)
Udacity Data Engineering ProjectsFew projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+161.71%)
DiscreetlyETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (-65.71%)
Airflow OperatorKubernetes custom controller and CRDs to managing Airflow
Stars: ✭ 278 (+58.86%)
Docker AirflowRepo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Stars: ✭ 29 (-83.43%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+353.14%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (-54.86%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+582.86%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+136%)
Afctlafctl helps to manage and deploy Apache Airflow projects faster and smoother.
Stars: ✭ 116 (-33.71%)
XeneA distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (-68%)
helpdeskYet another helpdesk based on multiple providers
Stars: ✭ 14 (-92%)
airflow-code-editorA plugin for Apache Airflow that allows you to edit DAGs in browser
Stars: ✭ 195 (+11.43%)
airflow-dbtApache Airflow integration for dbt
Stars: ✭ 233 (+33.14%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+379.43%)
Incubator DolphinschedulerApache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Stars: ✭ 6,916 (+3852%)
AirflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+13672%)
Terraform Aws AirflowTerraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker with CeleryExecutor
Stars: ✭ 69 (-60.57%)
Dag FactoryDynamically generate Apache Airflow DAGs from YAML configuration files
Stars: ✭ 385 (+120%)
Airflow ChartA Helm chart to install Apache Airflow on Kubernetes
Stars: ✭ 137 (-21.71%)
Airflow CookbookAirflow workflow management platform chef cookbook.
Stars: ✭ 58 (-66.86%)
Airflow Rest Api PluginA plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
Stars: ✭ 281 (+60.57%)
WhirlFast iterative local development and testing of Apache Airflow workflows
Stars: ✭ 111 (-36.57%)
Airflow ToolkitAny Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Stars: ✭ 51 (-70.86%)
dbt-on-airflowNo description or website provided.
Stars: ✭ 30 (-82.86%)
Airflow ExporterAirflow plugin to export dag and task based metrics to Prometheus.
Stars: ✭ 161 (-8%)
Data Pipelines With Apache AirflowDeveloped a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
Stars: ✭ 50 (-71.43%)
ap-airflowAstronomer Core Docker Images
Stars: ✭ 87 (-50.29%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-38.86%)
udacity-data-eng-proj2A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.
Stars: ✭ 25 (-85.71%)
ObjinsyncContinuously synchronize directories from remote object store to local filesystem
Stars: ✭ 29 (-83.43%)
ecs-airflowCloudformation templates for deploying Airflow in ECS
Stars: ✭ 37 (-78.86%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-22.86%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-84.57%)
airflow-tutorialUse Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (-45.14%)
DatabookA facebook for data
Stars: ✭ 26 (-85.14%)
Data Science Stack Cookiecutter🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (-12.57%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-26.86%)