Soda SqlMetric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+55.86%)
awesome-dbtA curated list of awesome dbt resources
Stars: ✭ 520 (+368.47%)
Udacity Data Engineering ProjectsFew projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+312.61%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-81.98%)
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (-0.9%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-52.25%)
FastETLPlugins do Airflow para implementação de pipelines de dados
Stars: ✭ 31 (-72.07%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+614.41%)
dbt-on-airflowNo description or website provided.
Stars: ✭ 30 (-72.97%)
dbt-sugardbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Stars: ✭ 139 (+25.23%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-77.48%)
airflow-dbtApache Airflow integration for dbt
Stars: ✭ 233 (+109.91%)
Docker AirflowRepo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Stars: ✭ 29 (-73.87%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+655.86%)
Data Science Stack Cookiecutter🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (+37.84%)
ObjinsyncContinuously synchronize directories from remote object store to local filesystem
Stars: ✭ 29 (-73.87%)
WhirlFast iterative local development and testing of Apache Airflow workflows
Stars: ✭ 111 (+0%)
Airflow ExporterAirflow plugin to export dag and task based metrics to Prometheus.
Stars: ✭ 161 (+45.05%)
DatabookA facebook for data
Stars: ✭ 26 (-76.58%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-3.6%)
Incubator DolphinschedulerApache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Stars: ✭ 6,916 (+6130.63%)
Airflow ChartA Helm chart to install Apache Airflow on Kubernetes
Stars: ✭ 137 (+23.42%)
AirflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+21612.61%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+272.07%)
Dag FactoryDynamically generate Apache Airflow DAGs from YAML configuration files
Stars: ✭ 385 (+246.85%)
Aws Airflow StackTurbine: the bare metals that gets you Airflow
Stars: ✭ 352 (+217.12%)
PaperboyA web frontend for scheduling Jupyter notebook reports
Stars: ✭ 221 (+99.1%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+976.58%)
Airflow Rest Api PluginA plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
Stars: ✭ 281 (+153.15%)
Terraform Aws AirflowTerraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker with CeleryExecutor
Stars: ✭ 69 (-37.84%)
Airflow OperatorKubernetes custom controller and CRDs to managing Airflow
Stars: ✭ 278 (+150.45%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (+21.62%)
DiscreetlyETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (-45.95%)
helpdeskYet another helpdesk based on multiple providers
Stars: ✭ 14 (-87.39%)
Airflow CookbookAirflow workflow management platform chef cookbook.
Stars: ✭ 58 (-47.75%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-85.59%)
Airflow TestingAirflow Unit Tests and Integration Tests
Stars: ✭ 175 (+57.66%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (+15.32%)
XeneA distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (-49.55%)
ap-airflowAstronomer Core Docker Images
Stars: ✭ 87 (-21.62%)
Airflow ToolkitAny Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Stars: ✭ 51 (-54.05%)