Gcp Data Engineer ExamStudy materials for the Google Cloud Professional Data Engineering Exam
Stars: ✭ 144 (+152.63%)
BenthosFancy stream processing made operationally mundane
Stars: ✭ 3,705 (+6400%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-7.02%)
iris3An upgraded and improved version of the Iris automatic GCP-labeling project
Stars: ✭ 38 (-33.33%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+1728.07%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+17.54%)
etl managerA python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-75.44%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+1010.53%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (+978.95%)
Tuna🐟 A streaming ETL for fish
Stars: ✭ 11 (-80.7%)
k8s-digesterAdd digests to container and init container images in Kubernetes pod and pod template specs. Use either as a mutating admission webhook, or as a client-side KRM function with kpt or kustomize.
Stars: ✭ 65 (+14.04%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-17.54%)
GcloudGitHub Action for interacting with Google Cloud Platform (GCP)
Stars: ✭ 153 (+168.42%)
beneathBeneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+14.04%)
DataformDataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+500%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-33.33%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+121.05%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+8529.82%)
GoogleCloudLoggingSwift (Darwin) library for logging application events in Google Cloud.
Stars: ✭ 24 (-57.89%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (-3.51%)
Bitcoin EtlETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+205.26%)
AirflowETLBlog post on ETL pipelines with Airflow
Stars: ✭ 20 (-64.91%)
deploy-cloudrunThis action deploys your container image to Cloud Run.
Stars: ✭ 238 (+317.54%)
GcpsketchnoteIf you are looking to become a Google Cloud Engineer , then you are at the right place. GCPSketchnote is series where I share Google Cloud concepts in quick and easy to learn format.
Stars: ✭ 2,631 (+4515.79%)
GothElixir package for Oauth authentication via Google Cloud APIs
Stars: ✭ 191 (+235.09%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+973.68%)
Unity SolutionsUse Firebase tools to incorporate common features into your games!
Stars: ✭ 95 (+66.67%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (-22.81%)
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+152.63%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+35.09%)
SmooksAn extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (+414.04%)
GimmeCreating time bound IAM Conditions with ease and flair
Stars: ✭ 92 (+61.4%)
cloudenvoyCross-application messaging for Ruby and Rails using Google Cloud Pub/Sub
Stars: ✭ 31 (-45.61%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+4084.21%)
RikoA Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (+2656.14%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+38.6%)
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+389.47%)
gisjogjaGISJOGJA - aplikasi web based sistem informasi geografis (SIG) / GIS wisata kota JOGJA - www.firstplato.com
Stars: ✭ 17 (-70.18%)
GCPAll files containing commands which can be used to complete GCP quests and challenge labs
Stars: ✭ 46 (-19.3%)
course-materialCourse Material for in28minutes courses on Java, Spring Boot, DevOps, AWS, Google Cloud, and Azure.
Stars: ✭ 544 (+854.39%)
SaynData processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (+38.6%)
gcp authMinimal authentication library for Google Cloud Platform (GCP)
Stars: ✭ 42 (-26.32%)
deploy-appengineA GitHub Action that deploys source code to Google App Engine.
Stars: ✭ 184 (+222.81%)
cloud-speech-and-vision-demosA set of demo applications that make use of google speech, nlp and vision apis based in angular2
Stars: ✭ 35 (-38.6%)
daggerDagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.
Stars: ✭ 238 (+317.54%)
Fog GoogleFog for Google Cloud Platform
Stars: ✭ 83 (+45.61%)
Ethereum EtlPython scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+1577.19%)
authA GitHub Action for authenticating to Google Cloud.
Stars: ✭ 567 (+894.74%)