prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Market-Mix-ModelingMarket Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
foofahFoofah: programming-by-example data transformation program synthesizer
MoNuSACThis repository contains my implementations of the algorithms which MoNuSAC participants could use for data preparation to train their models at ISBI 2020.
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
reskitA library for creating and curating reproducible pipelines for scientific and industrial machine learning
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark