All Projects → GoogleCloudPlatform → bigquery-data-lineage

GoogleCloudPlatform / bigquery-data-lineage

Licence: Apache-2.0 license
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to bigquery-data-lineage

document-processing-pipeline-for-regulated-industries
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Stars: ✭ 36 (-67.86%)
Mutual labels:  data-governance, data-lineage
datacatalog-tag-manager
Python package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Stars: ✭ 17 (-84.82%)
Mutual labels:  bigdata, data-governance
data-lineage
Generate and Visualize Data Lineage from query history
Stars: ✭ 166 (+48.21%)
Mutual labels:  data-governance, data-lineage
sqllineage
SQL Lineage Analysis Tool powered by Python
Stars: ✭ 348 (+210.71%)
Mutual labels:  data-governance, data-lineage
bqv
The simplest tool to manage views of BigQuery.
Stars: ✭ 22 (-80.36%)
Mutual labels:  bigquery, bigdata
auto-data-tokenize
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Stars: ✭ 21 (-81.25%)
Mutual labels:  dataflow, data-governance
Repatch
Dispatch reducers
Stars: ✭ 516 (+360.71%)
Mutual labels:  dataflow, data-management
datasphere-service
an open source dataworks platform
Stars: ✭ 20 (-82.14%)
Mutual labels:  bigdata, data-governance
DataflowTemplates
Convenient Dataflow pipelines for transforming data between cloud data sources
Stars: ✭ 22 (-80.36%)
Mutual labels:  bigquery, dataflow
bigflow
A Python framework for data processing on GCP.
Stars: ✭ 96 (-14.29%)
Mutual labels:  bigquery, dataflow
Gcp Variant Transforms
GCP Variant Transforms
Stars: ✭ 100 (-10.71%)
Mutual labels:  bigquery, dataflow
alphasql
AlphaSQL provides Integrated Type and Schema Check and Parallelization for SQL file set mainly for BigQuery
Stars: ✭ 35 (-68.75%)
Mutual labels:  bigquery, zetasql
Scio
A Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+1906.25%)
Mutual labels:  bigquery, dataflow
workflUX
An open-source, cloud-ready web application for simplified deployment of big data workflows.
Stars: ✭ 26 (-76.79%)
Mutual labels:  bigdata
hayabusa
Hayabusa: Simple and Fast Full-Text Search Engine for Massive System Log Data
Stars: ✭ 43 (-61.61%)
Mutual labels:  bigdata
Dnai.Editor
Dnai Editor - Visual Scripting (Node Editor)
Stars: ✭ 117 (+4.46%)
Mutual labels:  dataflow
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (-69.64%)
Mutual labels:  bigdata
maritime-charting-sample-scripts
Sample scripts and models to automate work in ArcGIS for Maritime: Charting
Stars: ✭ 19 (-83.04%)
Mutual labels:  data-management
raster-tiles-compactcache
Compact Cache V2 is used by ArcGIS to store raster tiles. The bundle file structure is very simple and optimized for quick access, resulting in improved performance over alternative formats.
Stars: ✭ 49 (-56.25%)
Mutual labels:  data-management
Pandas Gbq
Pandas Google BigQuery
Stars: ✭ 243 (+116.96%)
Mutual labels:  bigquery
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].