All Projects → opentrials-airflow → Similar Projects or Alternatives

118 Open source projects that are alternatives of or similar to opentrials-airflow

AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (+11.11%)
Mutual labels:  airflow, data-pipeline
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+38.89%)
Mutual labels:  airflow, data-pipeline
Airflow Exporter
Airflow plugin to export dag and task based metrics to Prometheus.
Stars: ✭ 161 (+794.44%)
Mutual labels:  airflow
fab-oidc
Flask-AppBuilder SecurityManager for OpenIDConnect
Stars: ✭ 28 (+55.56%)
Mutual labels:  airflow
Whirl
Fast iterative local development and testing of Apache Airflow workflows
Stars: ✭ 111 (+516.67%)
Mutual labels:  airflow
Airflow Scheduler Failover Controller
A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability
Stars: ✭ 204 (+1033.33%)
Mutual labels:  airflow
airflow-site
Apache Airflow Website
Stars: ✭ 95 (+427.78%)
Mutual labels:  airflow
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (+650%)
Mutual labels:  airflow
apache-airflow-cloudera-parcel
Parcel for Apache Airflow
Stars: ✭ 16 (-11.11%)
Mutual labels:  airflow
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (+394.44%)
Mutual labels:  airflow
aircan
💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher and Xloader for loading data to DataStore.
Stars: ✭ 24 (+33.33%)
Mutual labels:  airflow
Terraform Aws Airflow
Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker with CeleryExecutor
Stars: ✭ 69 (+283.33%)
Mutual labels:  airflow
Paperboy
A web frontend for scheduling Jupyter notebook reports
Stars: ✭ 221 (+1127.78%)
Mutual labels:  airflow
saisoku
Saisoku is a Python module that helps you build complex pipelines of batch file/directory transfer/sync jobs.
Stars: ✭ 40 (+122.22%)
Mutual labels:  data-pipeline
Airflow Testing
Airflow Unit Tests and Integration Tests
Stars: ✭ 175 (+872.22%)
Mutual labels:  airflow
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+116.67%)
Mutual labels:  data-pipeline
Airflow Chart
A Helm chart to install Apache Airflow on Kubernetes
Stars: ✭ 137 (+661.11%)
Mutual labels:  airflow
kedro-airflow
Kedro-Airflow makes it easy to deploy Kedro projects to Airflow.
Stars: ✭ 121 (+572.22%)
Mutual labels:  airflow
Telemetry Airflow
Airflow configuration for Telemetry
Stars: ✭ 125 (+594.44%)
Mutual labels:  airflow
airflow-client-python
Apache Airflow - OpenApi Client for Python
Stars: ✭ 172 (+855.56%)
Mutual labels:  airflow
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (+494.44%)
Mutual labels:  airflow
kedro-airflow-k8s
Kedro Plugin to support running pipelines on Kubernetes using Airflow.
Stars: ✭ 22 (+22.22%)
Mutual labels:  airflow
Airflow Training
Airflow training for the crunch conf
Stars: ✭ 83 (+361.11%)
Mutual labels:  airflow
machine-learning-data-pipeline
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (+22.22%)
Mutual labels:  data-pipeline
Airflow Cookbook
Airflow workflow management platform chef cookbook.
Stars: ✭ 58 (+222.22%)
Mutual labels:  airflow
aws-pdf-textract-pipeline
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Stars: ✭ 141 (+683.33%)
Mutual labels:  data-pipeline
Airflow Toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Stars: ✭ 51 (+183.33%)
Mutual labels:  airflow
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+1250%)
Mutual labels:  airflow
T-Watch
Real Time Twitter Sentiment Analysis Product
Stars: ✭ 20 (+11.11%)
Mutual labels:  airflow
Awesome Apache Airflow
Curated list of resources about Apache Airflow
Stars: ✭ 2,755 (+15205.56%)
Mutual labels:  airflow
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+194.44%)
Mutual labels:  airflow
Soda Sql
Metric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+861.11%)
Mutual labels:  airflow
incremental training
Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
Stars: ✭ 110 (+511.11%)
Mutual labels:  airflow
Airflow Doc Zh
📖 [译] Airflow 中文文档
Stars: ✭ 169 (+838.89%)
Mutual labels:  airflow
dbt-airflow-docker-compose
Execution of DBT models using Apache Airflow through Docker Compose
Stars: ✭ 76 (+322.22%)
Mutual labels:  airflow
Data Science Stack Cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (+750%)
Mutual labels:  airflow
Insight-GDELT-Feed
A way for home buyers to know about factors affecting a state
Stars: ✭ 43 (+138.89%)
Mutual labels:  airflow
Airflow Autoscaling Ecs
Airflow Deployment on AWS ECS Fargate Using Cloudformation
Stars: ✭ 136 (+655.56%)
Mutual labels:  airflow
Data-pipeline-project
Data pipeline project
Stars: ✭ 18 (+0%)
Mutual labels:  data-pipeline
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (+611.11%)
Mutual labels:  airflow
dc-sdk-js
一个基于浏览器环境的数据采集SDK
Stars: ✭ 52 (+188.89%)
Mutual labels:  data-pipeline
Afctl
afctl helps to manage and deploy Apache Airflow projects faster and smoother.
Stars: ✭ 116 (+544.44%)
Mutual labels:  airflow
FastETL
Plugins do Airflow para implementação de pipelines de dados
Stars: ✭ 31 (+72.22%)
Mutual labels:  airflow
Airflow in docker compose
Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)
Stars: ✭ 109 (+505.56%)
Mutual labels:  airflow
datajob
Build and deploy a serverless data pipeline on AWS with no effort.
Stars: ✭ 101 (+461.11%)
Mutual labels:  data-pipeline
Bitnami Docker Airflow
Bitnami Docker Image for Apache Airflow
Stars: ✭ 89 (+394.44%)
Mutual labels:  airflow
k3ai
A lightweight tool to get an AI Infrastructure Stack up in minutes not days. K3ai will take care of setup K8s for You, deploy the AI tool of your choice and even run your code on it.
Stars: ✭ 105 (+483.33%)
Mutual labels:  airflow
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (+355.56%)
Mutual labels:  airflow
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+6538.89%)
Mutual labels:  airflow
qunomon
Testbed of AI Systems Quality Management
Stars: ✭ 15 (-16.67%)
Mutual labels:  airflow
Discreetly
ETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (+233.33%)
Mutual labels:  airflow
pipeline
PipelineAI Kubeflow Distribution
Stars: ✭ 4,154 (+22977.78%)
Mutual labels:  airflow
Xene
A distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (+211.11%)
Mutual labels:  airflow
ob bulkstash
Bulk Stash is a docker rclone service to sync, or copy, files between different storage services. For example, you can copy files either to or from a remote storage services like Amazon S3 to Google Cloud Storage, or locally from your laptop to a remote storage.
Stars: ✭ 113 (+527.78%)
Mutual labels:  data-pipeline
Argo Workflows
Workflow engine for Kubernetes
Stars: ✭ 10,024 (+55588.89%)
Mutual labels:  airflow
scicloj.ml
A Clojure machine learning library
Stars: ✭ 152 (+744.44%)
Mutual labels:  data-pipeline
torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Stars: ✭ 165 (+816.67%)
Mutual labels:  airflow
airflow-boilerplate
A complete development environment setup for working with Airflow
Stars: ✭ 94 (+422.22%)
Mutual labels:  airflow
practical-data-engineering
Real estate dagster pipeline
Stars: ✭ 110 (+511.11%)
Mutual labels:  data-pipeline
fairflow
Functional Airflow DAG definitions.
Stars: ✭ 38 (+111.11%)
Mutual labels:  airflow
1-60 of 118 similar projects