All Projects → snowplow → snowplow-rdb-loader

snowplow / snowplow-rdb-loader

Licence: other
Stores Snowplow enriched events in Redshift

Programming Languages

scala
5932 projects
shell
77523 projects

Projects that are alternatives of or similar to snowplow-rdb-loader

Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+11257.14%)
Mutual labels:  redshift
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-23.81%)
Mutual labels:  redshift
node-redshift
A simple collection of tools to help you get started with Amazon Redshift from node.js
Stars: ✭ 66 (+214.29%)
Mutual labels:  redshift
Fluentmigrator
Fluent migrations framework for .NET
Stars: ✭ 2,636 (+12452.38%)
Mutual labels:  redshift
pytest-mock-resources
Pytest Fixtures that let you actually test against external resource (Postgres, Mongo, Redshift...) dependent code.
Stars: ✭ 84 (+300%)
Mutual labels:  redshift
go-localstack
Go Wrapper for using localstack
Stars: ✭ 56 (+166.67%)
Mutual labels:  redshift
Awesome Aws
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
Stars: ✭ 9,895 (+47019.05%)
Mutual labels:  redshift
chrome-snowplow-inspector
Web Extension for debugging Snowplow pixels.
Stars: ✭ 26 (+23.81%)
Mutual labels:  snowplow
objectiv-analytics
Powerful product analytics for data teams, with full control over data & models.
Stars: ✭ 399 (+1800%)
Mutual labels:  snowplow
firehoser
A wrapper around AWS Kinesis Firehose with retry logic and custom queuing behavior. Requires node >= 6.0.0
Stars: ✭ 22 (+4.76%)
Mutual labels:  redshift
snowplow-bigquery-loader
Loads Snowplow enriched events into Google BigQuery
Stars: ✭ 15 (-28.57%)
Mutual labels:  snowplow
Rin
Rin is a Redshift data Importer by SQS messaging.
Stars: ✭ 27 (+28.57%)
Mutual labels:  redshift
simple-ddl-parser
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
Stars: ✭ 76 (+261.9%)
Mutual labels:  redshift
Yuniql
Free and open source schema versioning and database migration made natively with .NET Core.
Stars: ✭ 156 (+642.86%)
Mutual labels:  redshift
dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Stars: ✭ 128 (+509.52%)
Mutual labels:  redshift
Go Sct
A color temperature setting library and CLI that operates in a similar way to f.lux and Redshift.
Stars: ✭ 112 (+433.33%)
Mutual labels:  redshift
tipoca-stream
Near real time cloud native data pipeline in AWS (CDC+Sink). Hosts code for RedshiftSink. RDS to RedshiftSink Pipeline with masking and reloading support.
Stars: ✭ 43 (+104.76%)
Mutual labels:  redshift
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+19.05%)
Mutual labels:  redshift
dotfiles
⚫📁 Dotfiles
Stars: ✭ 23 (+9.52%)
Mutual labels:  redshift
Hello-AWS-Data-Services
Sample code for AWS data service and ML courses on LinkedIn Learning
Stars: ✭ 144 (+585.71%)
Mutual labels:  redshift

Relational Database Loader

Build Status Release License Scala Steward Coverage Status

Introduction

This project contains applications required to load Snowplow data into relational databases.

RDB Shredder

RDB Shredder is a Spark job which:

  1. Reads Snowplow enriched events from S3
  2. Extracts any unstructured event JSONs and context JSONs found
  3. Validates that these JSONs conform to schema
  4. Adds metadata to these JSONs to track their origins
  5. Writes these JSONs out to nested folders dependent on their schema

It is designed to be run downstream of the Enrich job.

RDB Loader

RDB Loader (previously known as StorageLoader) is a Scala application that runs in background, discovering data, produced by RDB Shredder from SQS queue and loading it into one of possible storage targets.

RDB Stream Shredder (experimental)

An application similar to RDB Shredder, but working without Apache Spark or EMR and reading directly from Kinesis Stream. Only Shredder or Stream Shredder should be used.

Find out more

Technical Docs Setup Guide Roadmap & Contributing
i1 i2 i3
Technical Docs Setup Guide Roadmap

Copyright and License

Snowplow Relational Database Loader is copyright 2012-2021 Snowplow Analytics Ltd.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this software except in compliance with the License.

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].