GobblinA distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
ibisIBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
hyperdriveExtensible streaming ingestion pipeline on top of Apache Spark
borrow-bot💰 A bot for maximizing the borrow subreddit
data-prepperData Prepper is a component of the OpenSearch project that accepts, filters, transforms, enriches, and routes data at scale.