dbddbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (+50%)
starlakeStarlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-20%)
Sql RunnerRun templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake
Stars: ✭ 68 (+240%)
Tblstbls is a CI-Friendly tool for document a database, written in Go.
Stars: ✭ 940 (+4600%)
growthbookOpen Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+11610%)
dbt-ml-preprocessingA SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Stars: ✭ 128 (+540%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+100635%)
telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (+995%)
DdlparseDDL parase and Convert to BigQuery JSON schema and DDL statements
Stars: ✭ 52 (+160%)
carto-spatial-extensionA set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift and Postgres with Spatial Analytics capabilities
Stars: ✭ 131 (+555%)
tipoca-streamNear real time cloud native data pipeline in AWS (CDC+Sink). Hosts code for RedshiftSink. RDS to RedshiftSink Pipeline with masking and reloading support.
Stars: ✭ 43 (+115%)
DataflowTemplatesConvenient Dataflow pipelines for transforming data between cloud data sources
Stars: ✭ 22 (+10%)
node-redshiftA simple collection of tools to help you get started with Amazon Redshift from node.js
Stars: ✭ 66 (+230%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+295%)
logicaLogica is a logic programming language that compiles to StandardSQL and runs on Google BigQuery.
Stars: ✭ 1,469 (+7245%)
firehoserA wrapper around AWS Kinesis Firehose with retry logic and custom queuing behavior. Requires node >= 6.0.0
Stars: ✭ 22 (+10%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+165%)
pytest-mock-resourcesPytest Fixtures that let you actually test against external resource (Postgres, Mongo, Redshift...) dependent code.
Stars: ✭ 84 (+320%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+90%)
RinRin is a Redshift data Importer by SQS messaging.
Stars: ✭ 27 (+35%)
dekartGIS Visualisation for Amazon Athena and BigQuery
Stars: ✭ 131 (+555%)
target-and-marketA data-driven tool to identify the best candidates for a marketing campaign and optimize it.
Stars: ✭ 19 (-5%)
bqvThe simplest tool to manage views of BigQuery.
Stars: ✭ 22 (+10%)
gsc-loggerGoogle Search Console Logger for Google App Engine
Stars: ✭ 38 (+90%)
tag-managerWebsite analytics, JavaScript error tracking + analytics, tag manager, data ingest endpoint creation (tracking pixels). GDPR + CCPA compliant.
Stars: ✭ 279 (+1295%)
sparkbqSparklyr extension package to connect to Google BigQuery
Stars: ✭ 16 (-20%)
public-datasetsThe list of public blockchain datasets in BigQuery
Stars: ✭ 86 (+330%)
airflow-tutorialUse Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+380%)
amplitude-bigqueryExport your events from Amplitude to Google BigQuery/Google Cloud Storage
Stars: ✭ 28 (+40%)
go-bqloaderbqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-20%)
objectiv-analyticsPowerful product analytics for data teams, with full control over data & models.
Stars: ✭ 399 (+1895%)
dbqCLI tool to easily Decorate BigQuery table name
Stars: ✭ 13 (-35%)
bigflowA Python framework for data processing on GCP.
Stars: ✭ 96 (+380%)
argonCampaign Manager 360 and Display & Video 360 Reports to BigQuery connector
Stars: ✭ 31 (+55%)
spark-on-k8s-gcp-examplesExample Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub
Stars: ✭ 36 (+80%)
bigquery-to-datastoreExport a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Stars: ✭ 56 (+180%)
bigquery-geo-vizVisualize Google BigQuery geospatial data using Google Maps Platform APIs
Stars: ✭ 68 (+240%)
bigquery-data-lineageReference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (+460%)
iris3An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (+90%)
team-timesheetsTime tracking web app built as a replacement for old school timesheets.
Stars: ✭ 25 (+25%)
hive compared bqhive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Stars: ✭ 27 (+35%)
flight2bqRTLSDR ADS-B dump1090 to Google BigQuery
Stars: ✭ 33 (+65%)
Pandas GbqPandas Google BigQuery
Stars: ✭ 243 (+1115%)
gcp-mlGoogle Cloud Platform Machine Learning Samples
Stars: ✭ 31 (+55%)
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+25%)
pgsinkLogically replicate data out of Postgres into sinks (files, Google BigQuery, etc)
Stars: ✭ 53 (+165%)
Hadoop ConnectorsLibraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Stars: ✭ 218 (+990%)
MproveOpen source Business Intelligence tool 🎉
Stars: ✭ 212 (+960%)
google-cloudA collection of Google Cloud Platform (GCP) plugins
Stars: ✭ 34 (+70%)
GeomancerAutomated feature engineering for geospatial data
Stars: ✭ 194 (+870%)
Bigquery GrafanaGoogle BigQuery Datasource Plugin for Grafana.
Stars: ✭ 188 (+840%)
kuromoji-for-bigqueryTokenize Japanese text on BigQuery with Kuromoji in Apache Beam/Google Dataflow at scale
Stars: ✭ 11 (-45%)
simple-ddl-parserSimple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
Stars: ✭ 76 (+280%)