All Categories → No Category → data-preparation

Top 9 data-preparation open source projects

prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
MoNuSAC
This repository contains my implementations of the algorithms which MoNuSAC participants could use for data preparation to train their models at ISBI 2020.
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
reskit
A library for creating and curating reproducible pipelines for scientific and industrial machine learning
machine-learning-data-pipeline
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
1-9 of 9 data-preparation projects