airflow-tutorialUse Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+220%)
aircan💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher and Xloader for loading data to DataStore.
Stars: ✭ 24 (-20%)
Airflow OperatorKubernetes custom controller and CRDs to managing Airflow
Stars: ✭ 278 (+826.67%)
snowflake-starterA _simple_ starter template for Snowflake Cloud Data Platform
Stars: ✭ 31 (+3.33%)
PyRasgoHelper code to interact with Rasgo via our SDK, PyRasgo
Stars: ✭ 39 (+30%)
Example Airflow DagsExample DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+710%)
faldo more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Stars: ✭ 567 (+1790%)
ria-jitLightweight and performant dynamic binary translation for RISC–V code on x86–64
Stars: ✭ 38 (+26.67%)
Soda SqlMetric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+476.67%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+290%)
Data Science Stack Cookiecutter🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (+410%)
dbt-formatterFormatting for dbt jinja-flavored sql
Stars: ✭ 37 (+23.33%)
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+1480%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (+326.67%)
torchxTorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Stars: ✭ 165 (+450%)
Afctlafctl helps to manage and deploy Apache Airflow projects faster and smoother.
Stars: ✭ 116 (+286.67%)
airflow-code-editorA plugin for Apache Airflow that allows you to edit DAGs in browser
Stars: ✭ 195 (+550%)
airflow-boilerplateA complete development environment setup for working with Airflow
Stars: ✭ 94 (+213.33%)
lightdashAn open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+3506.67%)
dbt2lookerGenerate lookml for views from dbt models
Stars: ✭ 119 (+296.67%)
telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (+630%)
DiscreetlyETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (+100%)
XeneA distributed workflow runner focusing on performance and simplicity.
Stars: ✭ 56 (+86.67%)
Argo WorkflowsWorkflow engine for Kubernetes
Stars: ✭ 10,024 (+33313.33%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+76.67%)
Docker AirflowRepo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Stars: ✭ 29 (-3.33%)
dbt mlPackage for dbt that allows users to train, audit and use BigQuery ML models.
Stars: ✭ 41 (+36.67%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+2696.67%)
dbt-ml-preprocessingA SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Stars: ✭ 128 (+326.67%)
dbt ad reportingFivetran's ad reporting dbt package. Combine your Facebook, Google, Pinterest, Linkedin, Twitter, Snapchat and Microsoft advertising spend using this package.
Stars: ✭ 68 (+126.67%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+2543.33%)
ap-airflowAstronomer Core Docker Images
Stars: ✭ 87 (+190%)
Udacity Data Engineering ProjectsFew projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+1426.67%)
T-WatchReal Time Twitter Sentiment Analysis Product
Stars: ✭ 20 (-33.33%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+1276.67%)
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+266.67%)
Aws Airflow StackTurbine: the bare metals that gets you Airflow
Stars: ✭ 352 (+1073.33%)
metriqlThe metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (+656.67%)
re-datare_data - fix data issues before your users & CEO would discover them 😊
Stars: ✭ 955 (+3083.33%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-46.67%)
Insight-GDELT-FeedA way for home buyers to know about factors affecting a state
Stars: ✭ 43 (+43.33%)
ml-opsGet your MLOps (Level 1) platform started and going fast.
Stars: ✭ 81 (+170%)
kedro-airflowKedro-Airflow makes it easy to deploy Kedro projects to Airflow.
Stars: ✭ 121 (+303.33%)