Igela delightful machine learning tool that allows you to train, test, and use models without writing code
HyperGBMA full pipeline AutoML tool for tabular data
PyTOUGHA Python library for automating TOUGH2 simulations of subsurface fluid and heat flow
podiumPodium: a framework agnostic Python NLP library for data loading and preprocessing
BrainPrepPreprocessing pipeline on Brain MR Images through FSL and ANTs, including registration, skull-stripping, bias field correction, enhancement and segmentation.
dropEstPipeline for initial analysis of droplet-based single-cell RNA-seq data
pywedgeMakes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking
skippaSciKIt-learn Pipeline in PAndas
tweets-preprocessorRepo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
veridical-flowMaking it easier to build stable, trustworthy data-science pipelines.
oxygenjsThis a JavaScript Library for the Numerical Javascript and Machine Learning
sparklanesA lightweight data processing framework for Apache Spark
MLLabelUtils.jlUtility package for working with classification targets and label-encodings
SeqToolsA python library to manipulate and transform indexable data (lists, arrays, ...)
chariotDeliver the ready-to-train data to your NLP model.
Start majaTo process a Sentinel-2 time series with MAJA cloud detection and atmospheric correction processor
dmriprepdMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
prospectrR package: Misc. Functions for Processing and Sample Selection of Spectroscopic Data
arraymancer-visionSimple library for image loading, preprocessing and visualization for working with arraymancer.
multi-imbalancePython package for tackling multi-class imbalance problems. http://www.cs.put.poznan.pl/mlango/publications/multiimbalance/
NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
preprocessyPython package for Customizable Data Preprocessing Pipelines
Machine LearningA repository of resources for understanding the concepts of machine learning/deep learning.
remote-dataloaderPyTorch DataLoader processed in multiple remote computation machines for heavy data processings
3D Ground SegmentationA ground segmentation algorithm for 3D point clouds based on the work described in “Fast segmentation of 3D point clouds: a paradigm on LIDAR data for Autonomous Vehicle Applications”, D. Zermas, I. Izzat and N. Papanikolopoulos, 2017. Distinguish between road and non-road points. Road surface extraction. Plane fit ground filter
AutoTSAutomated Time Series Forecasting
torcharrowHigh performance model preprocessing library on PyTorch