sqllineageSQL Lineage Analysis Tool powered by Python
Stars: ✭ 348 (+210.71%)
datacatalog-tag-managerPython package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Stars: ✭ 17 (-84.82%)
RepatchDispatch reducers
Stars: ✭ 516 (+360.71%)
ScioA Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+1906.25%)
auto-data-tokenizeIdentify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Stars: ✭ 21 (-81.25%)
bqvThe simplest tool to manage views of BigQuery.
Stars: ✭ 22 (-80.36%)
data-lineageGenerate and Visualize Data Lineage from query history
Stars: ✭ 166 (+48.21%)
bigflowA Python framework for data processing on GCP.
Stars: ✭ 96 (-14.29%)
DataflowTemplatesConvenient Dataflow pipelines for transforming data between cloud data sources
Stars: ✭ 22 (-80.36%)
alphasqlAlphaSQL provides Integrated Type and Schema Check and Parallelization for SQL file set mainly for BigQuery
Stars: ✭ 35 (-68.75%)
BeastLoad data from Kafka to any data warehouse
Stars: ✭ 119 (+6.25%)
Dnai.EditorDnai Editor - Visual Scripting (Node Editor)
Stars: ✭ 117 (+4.46%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+10599.11%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (-27.68%)
raster-tiles-compactcacheCompact Cache V2 is used by ArcGIS to store raster tiles. The bundle file structure is very simple and optimized for quick access, resulting in improved performance over alternative formats.
Stars: ✭ 49 (-56.25%)
Pandas GbqPandas Google BigQuery
Stars: ✭ 243 (+116.96%)
Linq To BigqueryLINQ to BigQuery is C# LINQ Provider for Google BigQuery. It also enables Desktop GUI Client with LINQPad and plug-in driver.
Stars: ✭ 69 (-38.39%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-41.96%)
MproveOpen source Business Intelligence tool 🎉
Stars: ✭ 212 (+89.29%)
Datashare ToolkitDIY commercial datasets on Google Cloud Platform
Stars: ✭ 41 (-63.39%)
Pg2bqExport PostgreSQL tables to Google BigQuery
Stars: ✭ 30 (-73.21%)
Bigquery GrafanaGoogle BigQuery Datasource Plugin for Grafana.
Stars: ✭ 188 (+67.86%)
Professional ServicesCommon solutions and tools developed by Google Cloud's Professional Services team
Stars: ✭ 1,923 (+1616.96%)
workflUXAn open-source, cloud-ready web application for simplified deployment of big data workflows.
Stars: ✭ 26 (-76.79%)
hayabusaHayabusa: Simple and Fast Full-Text Search Engine for Massive System Log Data
Stars: ✭ 43 (-61.61%)
Ethereum Etl AirflowAirflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. What datasets do you want to be added to Ethereum ETL? Vote here: https://blockchain-etl.convas.io.
Stars: ✭ 89 (-20.54%)
Sql RunnerRun templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake
Stars: ✭ 68 (-39.29%)
Hadoop ConnectorsLibraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Stars: ✭ 218 (+94.64%)
DdlparseDDL parase and Convert to BigQuery JSON schema and DDL statements
Stars: ✭ 52 (-53.57%)
Ethereum EtlPython scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+753.57%)
GeomancerAutomated feature engineering for geospatial data
Stars: ✭ 194 (+73.21%)
Tblstbls is a CI-Friendly tool for document a database, written in Go.
Stars: ✭ 940 (+739.29%)
tag-managerWebsite analytics, JavaScript error tracking + analytics, tag manager, data ingest endpoint creation (tracking pixels). GDPR + CCPA compliant.
Stars: ✭ 279 (+149.11%)
DataflowtemplatesGoogle-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
Stars: ✭ 603 (+438.39%)
QuixQuix Notebook Manager
Stars: ✭ 184 (+64.29%)
BigrqueryAn interface to Google's BigQuery from R.
Stars: ✭ 430 (+283.93%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+1106.25%)
Bigquery PythonSimple Python client for interacting with Google BigQuery.
Stars: ✭ 397 (+254.46%)
Franchise🍟 a notebook sql client. what you get when have a lot of sequels.
Stars: ✭ 3,823 (+3313.39%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+17888.39%)
Bitcoin EtlETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+55.36%)
SqlpadWeb-based SQL editor run in your own private cloud. Supports MySQL, Postgres, SQL Server, Vertica, Crate, ClickHouse, Trino, Presto, SAP HANA, Cassandra, Snowflake, BigQuery, SQLite, and more with ODBC
Stars: ✭ 4,113 (+3572.32%)
Bigquery UtilsUseful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Stars: ✭ 338 (+201.79%)
hive compared bqhive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Stars: ✭ 27 (-75.89%)
Gpt2 Bert Reddit Bota bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models
Stars: ✭ 158 (+41.07%)
Almanac.httparchive.orgHTTP Archive's annual "State of the Web" report made by the web community
Stars: ✭ 310 (+176.79%)
PypinfoEasily view PyPI download statistics via Google's BigQuery.
Stars: ✭ 295 (+163.39%)
Mara Example Project 2An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+37.5%)
Issue Label BotCode For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
Stars: ✭ 292 (+160.71%)