All Projects → polygon-etl → Similar Projects or Alternatives

774 Open source projects that are alternatives of or similar to polygon-etl

astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+49.06%)
Mutual labels:  bigquery, airflow, etl
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-28.3%)
Mutual labels:  bigquery, etl, gcp
Bitcoin Etl
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+228.3%)
Mutual labels:  bigquery, etl, gcp
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+7.55%)
Mutual labels:  etl, gcp, data-engineering
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-54.72%)
Mutual labels:  airflow, etl, data-engineering
Ethereum Etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+1703.77%)
Mutual labels:  bigquery, etl, gcp
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-62.26%)
Mutual labels:  airflow, etl, data-engineering
bigflow
A Python framework for data processing on GCP.
Stars: ✭ 96 (+81.13%)
Mutual labels:  bigquery, gcp
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (+137.74%)
Mutual labels:  etl, data-engineering
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (+237.74%)
Mutual labels:  etl, data-engineering
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+45.28%)
Mutual labels:  etl, data-engineering
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-73.58%)
Mutual labels:  etl, data-engineering
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+6890.57%)
Mutual labels:  etl, data-engineering
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+4400%)
Mutual labels:  etl, data-engineering
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+1094.34%)
Mutual labels:  etl, data-engineering
Around Dataengineering
A Data Engineering & Machine Learning Knowledge Hub
Stars: ✭ 257 (+384.91%)
Mutual labels:  airflow, data-engineering
Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+764.15%)
Mutual labels:  airflow, data-engineering
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+107.55%)
Mutual labels:  airflow, data-engineering
hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
Stars: ✭ 16 (-69.81%)
Mutual labels:  bigquery, gcp
Phila Airflow
Stars: ✭ 16 (-69.81%)
Mutual labels:  airflow, etl
Discreetly
ETLy is an add-on dashboard service on top of Apache Airflow.
Stars: ✭ 60 (+13.21%)
Mutual labels:  airflow, etl
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (+54.72%)
Mutual labels:  airflow, data-engineering
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-69.81%)
Mutual labels:  bigquery, etl
iris3
An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (-28.3%)
Mutual labels:  bigquery, gcp
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-67.92%)
Mutual labels:  bigquery, etl
airflow-tutorial
Use Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+81.13%)
Mutual labels:  bigquery, airflow
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+22.64%)
Mutual labels:  etl, data-engineering
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-58.49%)
Mutual labels:  etl, data-engineering
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+545.28%)
Mutual labels:  etl, data-engineering
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (+20.75%)
Mutual labels:  etl, data-engineering
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+49.06%)
Mutual labels:  etl, data-engineering
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (+49.06%)
Mutual labels:  etl, data-engineering
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+9181.13%)
Mutual labels:  etl, data-engineering
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-16.98%)
Mutual labels:  etl, data-engineering
argon
Campaign Manager 360 and Display & Video 360 Reports to BigQuery connector
Stars: ✭ 31 (-41.51%)
Mutual labels:  bigquery, gcp
gcp-ml
Google Cloud Platform Machine Learning Samples
Stars: ✭ 31 (-41.51%)
Mutual labels:  bigquery, gcp
dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (-43.4%)
Mutual labels:  bigquery, etl
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (+215.09%)
Mutual labels:  airflow, data-engineering
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+1396.23%)
Mutual labels:  airflow, data-engineering
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-52.83%)
Mutual labels:  airflow, data-engineering
Everything-Tech
A collection of online resources to help you on your Tech journey.
Stars: ✭ 396 (+647.17%)
Mutual labels:  gcp, data-engineering
Airflow Toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Stars: ✭ 51 (-3.77%)
Mutual labels:  airflow, gcp
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+2154.72%)
Mutual labels:  airflow, etl
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+171.7%)
Mutual labels:  etl, data-engineering
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+358.49%)
Mutual labels:  airflow, etl
Soda Sql
Metric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+226.42%)
Mutual labels:  airflow, data-engineering
Airflow Autoscaling Ecs
Airflow Deployment on AWS ECS Fargate Using Cloudformation
Stars: ✭ 136 (+156.6%)
Mutual labels:  airflow, data-engineering
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-69.81%)
Mutual labels:  bigquery, etl
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+4318.87%)
Mutual labels:  bigquery, data-engineering
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (+101.89%)
Mutual labels:  airflow, etl
Datashare Toolkit
DIY commercial datasets on Google Cloud Platform
Stars: ✭ 41 (-22.64%)
Mutual labels:  bigquery, gcp
Ethereum Etl Airflow
Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. What datasets do you want to be added to Ethereum ETL? Vote here: https://blockchain-etl.convas.io.
Stars: ✭ 89 (+67.92%)
Mutual labels:  bigquery, gcp
awesome-bigquery-views
Useful SQL queries for Blockchain ETL datasets in BigQuery.
Stars: ✭ 325 (+513.21%)
Mutual labels:  gcp, data-engineering
Mara Example Project 2
An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+190.57%)
Mutual labels:  bigquery, etl
airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+109.43%)
Mutual labels:  airflow, data-engineering
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-11.32%)
Mutual labels:  etl, data-engineering
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1054.72%)
Mutual labels:  etl, data-engineering
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (+67.92%)
Mutual labels:  airflow, etl
snowplow-bigquery-loader
Loads Snowplow enriched events into Google BigQuery
Stars: ✭ 15 (-71.7%)
Mutual labels:  bigquery, gcp
hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (-30.19%)
Mutual labels:  etl, data-engineering
1-60 of 774 similar projects