All Projects → bigquery-kafka-connect → Similar Projects or Alternatives

935 Open source projects that are alternatives of or similar to bigquery-kafka-connect

Ethereum Etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+5523.53%)
Mutual labels:  bigquery, etl, google-cloud
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-5.88%)
Mutual labels:  bigquery, etl, google-cloud
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+364.71%)
Mutual labels:  bigquery, etl
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+2023.53%)
Mutual labels:  big-data, etl
Esper Tv
Esper instance for TV news analysis
Stars: ✭ 37 (+117.65%)
Mutual labels:  big-data, google-cloud
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (+382.35%)
Mutual labels:  big-data, kafka-connect
argon
Campaign Manager 360 and Display & Video 360 Reports to BigQuery connector
Stars: ✭ 31 (+82.35%)
Mutual labels:  bigquery, google-cloud
kafka-connect-datagen
A Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+58.82%)
Mutual labels:  etl, kafka-connect
Kafka Connect
equivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (+500%)
Mutual labels:  etl, kafka-connect
bigquery-to-datastore
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Stars: ✭ 56 (+229.41%)
Mutual labels:  bigquery, google-cloud
bqv
The simplest tool to manage views of BigQuery.
Stars: ✭ 22 (+29.41%)
Mutual labels:  bigquery, google-cloud
iris3
An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (+123.53%)
Mutual labels:  bigquery, google-cloud
bandar-log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (+17.65%)
Mutual labels:  big-data, etl
bigtable
TypeScript Bigtable Client with 🔋🔋 included.
Stars: ✭ 13 (-23.53%)
Mutual labels:  big-data, google-cloud
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (+464.71%)
Mutual labels:  big-data, kafka-connect
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (+747.06%)
Mutual labels:  big-data, etl
dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (+76.47%)
Mutual labels:  bigquery, etl
redis-connect-dist
Real-Time Event Streaming & Change Data Capture
Stars: ✭ 21 (+23.53%)
Mutual labels:  etl, connect
Smooks
An extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (+1623.53%)
Mutual labels:  big-data, etl
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+1282.35%)
Mutual labels:  big-data, etl
Kafka Ui
Open-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (+1252.94%)
Mutual labels:  big-data, kafka-connect
kuromoji-for-bigquery
Tokenize Japanese text on BigQuery with Kuromoji in Apache Beam/Google Dataflow at scale
Stars: ✭ 11 (-35.29%)
Mutual labels:  bigquery, google-cloud
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+123.53%)
Mutual labels:  bigquery, etl
Mara Example Project 2
An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+805.88%)
Mutual labels:  bigquery, etl
Spark Bigquery Connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Stars: ✭ 126 (+641.18%)
Mutual labels:  bigquery, google-cloud
Scio
A Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+13117.65%)
Mutual labels:  bigquery, google-cloud
Magnolify
A collection of Magnolia add-on modules
Stars: ✭ 81 (+376.47%)
Mutual labels:  bigquery, google-cloud
maxwell-sink
consume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (-5.88%)
Mutual labels:  etl, kafka-connect
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (+11.76%)
Mutual labels:  big-data, etl
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+211.76%)
Mutual labels:  bigquery, etl
Aws Etl Orchestrator
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+1341.18%)
Mutual labels:  big-data, etl
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (+723.53%)
Mutual labels:  big-data, etl
Bitcoin Etl
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+923.53%)
Mutual labels:  bigquery, etl
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+364.71%)
Mutual labels:  big-data, etl
ob google-bigquery
This service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about installation, configuration or ongoing maintenance related to an SDK environment. This can be helpful to those who would prefer to not to be responsible for those activities.
Stars: ✭ 43 (+152.94%)
Mutual labels:  bigquery, google-cloud
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+129.41%)
Mutual labels:  big-data, etl
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-5.88%)
Mutual labels:  bigquery, etl
pgsink
Logically replicate data out of Postgres into sinks (files, Google BigQuery, etc)
Stars: ✭ 53 (+211.76%)
Mutual labels:  bigquery
functions-framework-php
FaaS (Function as a service) framework for writing portable PHP functions
Stars: ✭ 186 (+994.12%)
Mutual labels:  google-cloud
kafka-connect-jenkins
Kafka Connect Connector for Jenkins Open Source Continuous Integration Tool
Stars: ✭ 29 (+70.59%)
Mutual labels:  kafka-connect
sparkucx
A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (+88.24%)
Mutual labels:  big-data
activemodel-datastore
Ruby on Rails with Active Model and Google Cloud Datastore. Extracted from Agrimatics Aero.
Stars: ✭ 47 (+176.47%)
Mutual labels:  google-cloud
insightedge
InsightEdge Core
Stars: ✭ 22 (+29.41%)
Mutual labels:  big-data
libssh2.nim
Nim wrapper for libssh2
Stars: ✭ 25 (+47.06%)
Mutual labels:  connect
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-17.65%)
Mutual labels:  etl
emulator-tools
Google Cloud BigTable and PubSub emulator tools to make development a breeze
Stars: ✭ 16 (-5.88%)
Mutual labels:  google-cloud
rust-goauth
Crate for authenticating Server to Server Apps for Google Cloud Engine.
Stars: ✭ 20 (+17.65%)
Mutual labels:  google-cloud
OpenKettleWebUI
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+711.76%)
Mutual labels:  etl
google-cloud-powershell
PowerShell cmdlets for the Google Cloud Platform
Stars: ✭ 120 (+605.88%)
Mutual labels:  google-cloud
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+13782.35%)
Mutual labels:  big-data
cloudberry
Big Data Visualization
Stars: ✭ 89 (+423.53%)
Mutual labels:  big-data
Php-Google-Vision-Api
Google Vision Api for PHP (https://cloud.google.com/vision/)
Stars: ✭ 61 (+258.82%)
Mutual labels:  google-cloud
siembol
An open-source, real-time Security Information & Event Management tool based on big data technologies, providing a scalable, advanced security analytics framework.
Stars: ✭ 153 (+800%)
Mutual labels:  big-data
sql-to-redis
🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (+5.88%)
Mutual labels:  etl
rastercube
rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-11.76%)
Mutual labels:  big-data
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+294.12%)
Mutual labels:  etl
Cloud-Service-Providers-Free-Tier-Overview
Comparing the free tier offers of the major cloud providers like AWS, Azure, GCP, Oracle etc.
Stars: ✭ 226 (+1229.41%)
Mutual labels:  google-cloud
incubator-liminal
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+588.24%)
Mutual labels:  big-data
airavata-php-gateway
Mirror of Apache Airavata PHP Gateway
Stars: ✭ 15 (-11.76%)
Mutual labels:  big-data
google-cloud
A collection of Google Cloud Platform (GCP) plugins
Stars: ✭ 34 (+100%)
Mutual labels:  bigquery
1-60 of 935 similar projects