All Categories → No Category → data-lake

Top 10 data-lake open source projects

zeeqs
Query API for aggregated Zeebe data
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
hiveberg
Demonstration of a Hive Input Format for Iceberg
herd-mdl
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
analyzing-reddit-sentiment-with-aws
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
1-10 of 10 data-lake projects