beneathBeneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+170.83%)
Mutual labels: etl, data-engineering, data-pipelines
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+500%)
Mutual labels: etl, data-engineering, data-pipelines
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+120.83%)
Mutual labels: airflow, etl, data-engineering
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-16.67%)
Mutual labels: airflow, etl, data-engineering
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+95.83%)
Mutual labels: etl, data-engineering
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+137.5%)
Mutual labels: etl, data-engineering
neon-workshopA Pachyderm deep learning tutorial for conference workshops
Stars: ✭ 19 (-20.83%)
Mutual labels: data-engineering, data-pipelines
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+83.33%)
Mutual labels: etl, data-engineering
hive-metastore-clientA client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (+54.17%)
Mutual labels: etl, data-engineering
viewflowViewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+358.33%)
Mutual labels: airflow, data-engineering
arthur-redshift-etlELT Code for your Data Warehouse
Stars: ✭ 22 (-8.33%)
Mutual labels: etl, data-engineering
rivery cliRivery CLI
Stars: ✭ 16 (-33.33%)
Mutual labels: etl, data-pipelines
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+220.83%)
Mutual labels: etl, data-engineering
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2450%)
Mutual labels: etl, data-engineering
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+1062.5%)
Mutual labels: etl, data-engineering
ml-in-productionThe practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Stars: ✭ 29 (+20.83%)
Mutual labels: data-engineering, data-pipelines
airflow-dbt-pythonA collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+362.5%)
Mutual labels: airflow, data-engineering
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+4.17%)
Mutual labels: airflow, data-engineering
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+229.17%)
Mutual labels: airflow, etl