go-bqloaderbqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-87.3%)
kuromoji-for-bigqueryTokenize Japanese text on BigQuery with Kuromoji in Apache Beam/Google Dataflow at scale
Stars: ✭ 11 (-91.27%)
ob google-bigqueryThis service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about installation, configuration or ongoing maintenance related to an SDK environment. This can be helpful to those who would prefer to not to be responsible for those activities.
Stars: ✭ 43 (-65.87%)
iris3An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (-69.84%)
bqvThe simplest tool to manage views of BigQuery.
Stars: ✭ 22 (-82.54%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+9410.32%)
argonCampaign Manager 360 and Display & Video 360 Reports to BigQuery connector
Stars: ✭ 31 (-75.4%)
bigquery-to-datastoreExport a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Stars: ✭ 56 (-55.56%)
ScioA Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+1683.33%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-48.41%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+15889.68%)
Ethereum EtlPython scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+658.73%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (-35.71%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+1107.14%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-4.76%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1373.02%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-12.7%)
Professional ServicesCommon solutions and tools developed by Google Cloud's Professional Services team
Stars: ✭ 1,923 (+1426.19%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-14.29%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+1193.65%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+8930.16%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-15.08%)
Ruby DockerRuby runtime for Google Cloud Platform
Stars: ✭ 122 (-3.17%)
DeequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Stars: ✭ 2,020 (+1503.17%)
Teammate AndroidA Team Management app for creating tournaments and games for various sports
Stars: ✭ 116 (-7.94%)
ArchivesparkAn Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-11.9%)
Istio WorkshopIn this workshop, you'll learn how to install and configure Istio, an open source framework for connecting, securing, and managing microservices, on Google Kubernetes Engine, Google’s hosted Kubernetes product. You will also deploy an Istio-enabled multi-service application
Stars: ✭ 120 (-4.76%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-11.9%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-3.17%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+1177.78%)
Parquet IndexSpark SQL index for Parquet tables
Stars: ✭ 109 (-13.49%)
MaisUniversalizando o acesso a dados no Brasil. Docs: https://basedosdados.github.io/mais/
Stars: ✭ 122 (-3.17%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-14.29%)
Esp V2A service proxy that provides API management capabilities using Google Service Infrastructure.
Stars: ✭ 120 (-4.76%)
Microservices DemoSample cloud-native application with 10 microservices showcasing Kubernetes, Istio, gRPC and OpenCensus.
Stars: ✭ 11,369 (+8923.02%)
BeastLoad data from Kafka to any data warehouse
Stars: ✭ 119 (-5.56%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+1038.89%)
Spark Infotheoretic Feature SelectionThis package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
Stars: ✭ 123 (-2.38%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1312.7%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-9.52%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-16.67%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-16.67%)
ZparkioBoiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-3.97%)
Spring Shiro SparkSpring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (-9.52%)
TyphoonMinimal and free Kubernetes distribution with Terraform
Stars: ✭ 1,397 (+1008.73%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-19.84%)
Kubernetes NexusRun Sonatype Nexus Repository Manager OSS on top of Kubernetes (GKE). Includes instructions for automated backups (GCS) and day-to-day usage.
Stars: ✭ 122 (-3.17%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (-10.32%)