airflow-dbt-pythonA collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+270%)
airflow-dbtApache Airflow integration for dbt
Stars: ✭ 233 (+676.67%)
spark-utilsUtility functions for dbt projects running on Spark
Stars: ✭ 19 (-36.67%)
airflow-ciApache Airflow CI pipeline
Stars: ✭ 18 (-40%)
FastETLPlugins do Airflow para implementação de pipelines de dados
Stars: ✭ 31 (+3.33%)
ecs-airflowCloudformation templates for deploying Airflow in ECS
Stars: ✭ 37 (+23.33%)
dbt-clickhouseThe Clickhouse plugin for dbt (data build tool)
Stars: ✭ 77 (+156.67%)
dbt2lookerGenerate lookml for views from dbt models
Stars: ✭ 119 (+296.67%)
dbt artifactsA dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts
Stars: ✭ 119 (+296.67%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+163.33%)
aircalVisualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.
Stars: ✭ 66 (+120%)
udacity-data-eng-proj2A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.
Stars: ✭ 25 (-16.67%)
opentrials-airflowConfiguration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (-40%)
dbt-sugardbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Stars: ✭ 139 (+363.33%)
pre-commit-dbt🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Stars: ✭ 149 (+396.67%)
qunomonTestbed of AI Systems Quality Management
Stars: ✭ 15 (-50%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-16.67%)
k3aiA lightweight tool to get an AI Infrastructure Stack up in minutes not days. K3ai will take care of setup K8s for You, deploy the AI tool of your choice and even run your code on it.
Stars: ✭ 105 (+250%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-10%)
fairflowFunctional Airflow DAG definitions.
Stars: ✭ 38 (+26.67%)
airflow-tutorialUse Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+220%)
incremental trainingRepo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
Stars: ✭ 110 (+266.67%)
snowflake-starterA _simple_ starter template for Snowflake Cloud Data Platform
Stars: ✭ 31 (+3.33%)
faldo more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Stars: ✭ 567 (+1790%)
ria-jitLightweight and performant dynamic binary translation for RISC–V code on x86–64
Stars: ✭ 38 (+26.67%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+290%)
dbt-formatterFormatting for dbt jinja-flavored sql
Stars: ✭ 37 (+23.33%)
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+1480%)
torchxTorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Stars: ✭ 165 (+450%)
airflow-code-editorA plugin for Apache Airflow that allows you to edit DAGs in browser
Stars: ✭ 195 (+550%)
airflow-boilerplateA complete development environment setup for working with Airflow
Stars: ✭ 94 (+213.33%)
lightdashAn open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+3506.67%)
telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (+630%)
dbt-invokeA CLI for creating, updating, and deleting dbt property files
Stars: ✭ 42 (+40%)
dbt-spotify-analyticsContainerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase
Stars: ✭ 92 (+206.67%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+76.67%)
dbt mlPackage for dbt that allows users to train, audit and use BigQuery ML models.
Stars: ✭ 41 (+36.67%)
dbt-ml-preprocessingA SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Stars: ✭ 128 (+326.67%)
dbt ad reportingFivetran's ad reporting dbt package. Combine your Facebook, Google, Pinterest, Linkedin, Twitter, Snapchat and Microsoft advertising spend using this package.
Stars: ✭ 68 (+126.67%)
ap-airflowAstronomer Core Docker Images
Stars: ✭ 87 (+190%)
T-WatchReal Time Twitter Sentiment Analysis Product
Stars: ✭ 20 (-33.33%)
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+266.67%)
metriqlThe metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (+656.67%)
re-datare_data - fix data issues before your users & CEO would discover them 😊
Stars: ✭ 955 (+3083.33%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-46.67%)