All Projects → awslabs → Aws Serverless Data Lake Framework

awslabs / Aws Serverless Data Lake Framework

Licence: other
Enterprise-grade, production-hardened, serverless data lake on AWS

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Aws Serverless Data Lake Framework

Awesome Aws Workshops
(Unofficial) curated list of awesome workshops found around in the internet. As we all have been there, finding that workshop that you have just attended shouldn't be hard. The idea is to provide an easy central repository, in a collaborative way.
Stars: ✭ 302 (+68.72%)
Mutual labels:  aws, serverless, analytics
Deep Framework
Full-stack JavaScript Framework for Cloud-Native Web Applications (perfect for Serverless use cases)
Stars: ✭ 533 (+197.77%)
Mutual labels:  aws, serverless, framework
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+91.06%)
Mutual labels:  analytics, etl, data-engineering
Serverless Analytics
Track website visitors with Serverless Analytics using Kinesis, Lambda, and TypeScript.
Stars: ✭ 219 (+22.35%)
Mutual labels:  aws, serverless, analytics
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-55.87%)
Mutual labels:  etl, data-engineering, framework
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (-63.69%)
Mutual labels:  etl, analytics, data-engineering
Midway
🍔 A Node.js Serverless Framework for front-end/full-stack developers. Build the application for next decade. Works on AWS, Alibaba Cloud, Tencent Cloud and traditional VM/Container. Super easy integrate with React and Vue. 🌈
Stars: ✭ 5,080 (+2737.99%)
Mutual labels:  aws, serverless, framework
Aws Etl Orchestrator
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+36.87%)
Mutual labels:  aws, serverless, etl
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-55.87%)
Mutual labels:  analytics, etl, data-engineering
Aws Auto Terminate Idle Emr
AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-88.27%)
Mutual labels:  aws, serverless, etl
Middy
🛵 The stylish Node.js middleware engine for AWS Lambda
Stars: ✭ 2,592 (+1348.04%)
Mutual labels:  aws, serverless, framework
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1232.4%)
Mutual labels:  aws, etl, data-engineering
Shep
A framework for building JavaScript Applications with AWS API Gateway and Lambda
Stars: ✭ 376 (+110.06%)
Mutual labels:  aws, serverless, framework
Panther
Detect threats with log data and improve cloud security posture
Stars: ✭ 885 (+394.41%)
Mutual labels:  aws, serverless, etl
Iopipe Js Core
Observe and develop serverless apps with confidence on AWS Lambda with Tracing, Metrics, Profiling, Monitoring, and more.
Stars: ✭ 123 (-31.28%)
Mutual labels:  aws, serverless, analytics
Booster
Booster Cloud Framework
Stars: ✭ 136 (-24.02%)
Mutual labels:  aws, serverless, framework
Archive aws Lambda Go Net
Network I/O interface for AWS Lambda Go runtime.
Stars: ✭ 151 (-15.64%)
Mutual labels:  aws, serverless
Data Engineering Nanodegree
Projects done in the Data Engineering Nanodegree by Udacity.com
Stars: ✭ 151 (-15.64%)
Mutual labels:  aws, data-engineering
Aws Amplify Workshop React
Building Serverless React Applications with AWS Amplify
Stars: ✭ 155 (-13.41%)
Mutual labels:  aws, serverless
Cartoonify
Deploy and scale serverless machine learning app - in 4 steps.
Stars: ✭ 157 (-12.29%)
Mutual labels:  aws, serverless

Serverless Data Lake Framework (SDLF)

An AWS Professional Service open source initiative | [email protected]

The Serverless Data Lake Framework (SDLF) is a collection of reusable artifacts aimed at accelerating the delivery of enterprise data lakes on AWS, shortening the deployment time to production from several months to a few weeks. It can be used by AWS teams, partners and customers to implement the foundational structure of a data lake following best practices.

Motivation

A data lake gives your organization agility. It provides a repository where consumers can quickly find the data they need and use it in their business projects. However, building a data lake can be complex; there’s a lot to think about beyond the storage of files. For example, how do you catalog the data so you know what you’ve stored? What ingestion pipelines do you need? How do you manage data quality? How do you keep the code for your transformations under source control? How do you manage development, test and production environments? Building a solution that addresses these use cases can take many weeks and this time can be better spent innovating with data and achieving business goals. The SDLF is a collection of production-hardened, best practice templates which accelerate your data lake implementation journey on AWS, so that you can focus on use cases that generate value for business.

Public References

AWS Serverless Data Lake Framework

Workshop

To quickly get started with SDLF, follow our workshop:

https://sdlf.workshop.aws/

Read The Docs

Ingestion/Processing Library

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].