Airflow TestingAirflow Unit Tests and Integration Tests
Stars: ✭ 175 (+525%)
udacity-data-eng-proj2A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.
Stars: ✭ 25 (-10.71%)
Terraform Aws AirflowTerraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker with CeleryExecutor
Stars: ✭ 69 (+146.43%)
ecs-airflowCloudformation templates for deploying Airflow in ECS
Stars: ✭ 37 (+32.14%)
token-cliCommand line utility for interacting with OAuth2 infrastructure to generate tokens
Stars: ✭ 19 (-32.14%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-3.57%)
Airflow CookbookAirflow workflow management platform chef cookbook.
Stars: ✭ 58 (+107.14%)
airflow-tutorialUse Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+242.86%)
Airflow ExporterAirflow plugin to export dag and task based metrics to Prometheus.
Stars: ✭ 161 (+475%)
Airflow ToolkitAny Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Stars: ✭ 51 (+82.14%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-28.57%)
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+292.86%)
Data Pipelines With Apache AirflowDeveloped a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
Stars: ✭ 50 (+78.57%)
ml-opsGet your MLOps (Level 1) platform started and going fast.
Stars: ✭ 81 (+189.29%)
Airflow ChartA Helm chart to install Apache Airflow on Kubernetes
Stars: ✭ 137 (+389.29%)
ObjinsyncContinuously synchronize directories from remote object store to local filesystem
Stars: ✭ 29 (+3.57%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+317.86%)
airflow-dbt-pythonA collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+296.43%)
opentrials-airflowConfiguration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (-35.71%)
FastETLPlugins do Airflow para implementação de pipelines de dados
Stars: ✭ 31 (+10.71%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (+382.14%)
DatabookA facebook for data
Stars: ✭ 26 (-7.14%)
qunomonTestbed of AI Systems Quality Management
Stars: ✭ 15 (-46.43%)
dex-operatorA Kubernetes operator for Dex
Stars: ✭ 16 (-42.86%)
fairflowFunctional Airflow DAG definitions.
Stars: ✭ 38 (+35.71%)
incremental trainingRepo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
Stars: ✭ 110 (+292.86%)
Incubator DolphinschedulerApache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Stars: ✭ 6,916 (+24600%)
Insight-GDELT-FeedA way for home buyers to know about factors affecting a state
Stars: ✭ 43 (+53.57%)
PaperboyA web frontend for scheduling Jupyter notebook reports
Stars: ✭ 221 (+689.29%)
Openiddict SamplesASP.NET Core, Microsoft.Owin/ASP.NET 4.x and JavaScript samples for OpenIddict
Stars: ✭ 214 (+664.29%)
AirflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+85975%)
Openiddict CoreVersatile OpenID Connect stack for ASP.NET Core and Microsoft.Owin (compatible with ASP.NET 4.6.1)
Stars: ✭ 2,275 (+8025%)
WhirlFast iterative local development and testing of Apache Airflow workflows
Stars: ✭ 111 (+296.43%)
Dag FactoryDynamically generate Apache Airflow DAGs from YAML configuration files
Stars: ✭ 385 (+1275%)
aircan💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher and Xloader for loading data to DataStore.
Stars: ✭ 24 (-14.29%)
Jose JwtUltimate Javascript Object Signing and Encryption (JOSE) and JSON Web Token (JWT) Implementation for .NET and .NET Core
Stars: ✭ 692 (+2371.43%)
Angular Auth Oidc Clientnpm package for OpenID Connect, OAuth Code Flow with PKCE, Refresh tokens, Implicit Flow
Stars: ✭ 577 (+1960.71%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (+282.14%)
Portier BrokerPortier Broker reference implementation, written in Rust
Stars: ✭ 474 (+1592.86%)
Airflow Rest Api PluginA plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
Stars: ✭ 281 (+903.57%)
helpdeskYet another helpdesk based on multiple providers
Stars: ✭ 14 (-50%)
dbt-on-airflowNo description or website provided.
Stars: ✭ 30 (+7.14%)
loginappWeb application for Kubernetes CLI configuration with OIDC
Stars: ✭ 74 (+164.29%)
kedro-airflow-k8sKedro Plugin to support running pipelines on Kubernetes using Airflow.
Stars: ✭ 22 (-21.43%)
pipelinePipelineAI Kubeflow Distribution
Stars: ✭ 4,154 (+14735.71%)
Soda SqlMetric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+517.86%)