Top 16 emr open source projects

Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
basin
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
sensu-plugins-aws
This plugin provides native AWS instrumentation for monitoring and metrics collection, including: health and metrics for various AWS services, such as EC2, RDS, ELB, and more, as well as handlers for EC2, SES, and SNS.
GooglePlay-Web-Crawler
Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
tscharts
Django REST framework-based Digital Patient Registration and EMR backend
pdd-graph
PDD Graph : Bridging MIMIC-III and Linked Data Cloud
Hello-AWS-Data-Services
Sample code for AWS data service and ML courses on LinkedIn Learning
sbt-lighter
SBT plugin for Apache Spark on AWS EMR
terraform-emr-spark-example
An example Terraform project that will configure a Secure and Customizable Spark Cluster on Amazon EMR.
learning-hadoop-and-spark
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
1-16 of 16 emr projects