All Categories → No Category → data-pipelines

Top 13 data-pipelines open source projects

Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
CogStack-NiFi
Building data processing pipelines for documents processing with NLP using Apache NiFi and related services
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
1-13 of 13 data-pipelines projects