All Categories → Data Processing → data-augmentation

Top 112 data-augmentation open source projects

Mixup Generator
An implementation of "mixup: Beyond Empirical Risk Minimization"
Zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Nlp Data Augmentation
Data Augmentation for NLP. NLP数据增强
Syndata Generation
Code used to generate synthetic scenes and bounding box annotations for object detection. This was used to generate data used in the Cut, Paste and Learn paper
Scaper
A library for soundscape synthesis and augmentation
Tensorflow Mnist Cnn
MNIST classification using Convolutional NeuralNetwork. Various techniques such as data augmentation, dropout, batchnormalization, etc are implemented.
Tsaug
A Python package for time series augmentation
Muda
A library for augmenting annotated audio data
Torch videovision
Transforms for video datasets in pytorch
Stylealign
[ICCV 2019]Aggregation via Separation: Boosting Facial Landmark Detector with Semi-Supervised Style Transition
Torch Audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Imagecorruptions
Python package to corrupt arbitrary images.
Evoskeleton
Official project website for the CVPR 2020 paper (Oral Presentation) "Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data"
Copy Paste Aug
Copy-paste augmentation for segmentation and detection tasks
Torchsample
High-Level Training, Data Augmentation, and Utilities for Pytorch
Ghost Free Shadow Removal
[AAAI 2020] Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN
Unsupervised Data Augmentation
Unofficial PyTorch Implementation of Unsupervised Data Augmentation.
Semsegpipeline
A simpler way of reading and augmenting image segmentation data into TensorFlow
Aaltd18
Data augmentation using synthetic data for time series classification with deep residual networks
All Conv Keras
All Convolutional Network: (https://arxiv.org/abs/1412.6806#) implementation in Keras
What I Have Read
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
Fcn train
The code includes all the file that you need in the training stage for FCN
Cutmix
a Ready-to-use PyTorch Extension of Unofficial CutMix Implementations with more improved performance.
Textattack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP
Wb color augmenter
WB color augmenter improves the accuracy of image classification and image semantic segmentation methods by emulating different WB effects (ICCV 2019) [Python & Matlab].
Synthetic Occlusion
Synthetic Occlusion Augmentation
Pose Adv Aug
Code for "Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation" (CVPR 2018)
Grand
Source code and dataset of the NeurIPS 2020 paper "Graph Random Neural Network for Semi-Supervised Learning on Graphs"
Pedestrian Synthesis Gan
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
Dda
Differentiable Data Augmentation Library
Doccreator
DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation
Dips
NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation
Handwriting recogition using adversarial learning
[CVPR 2019] "Handwriting Recognition in Low-resource Scripts using Adversarial Learning ”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
Nlp xiaojiang
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Veri Artirma Data Augmentation
Bu repoda veri artırma (data augmentation) ile ilgili pratik uygulamalara ulaşabilirsiniz.
Eda nlp
Data augmentation for NLP, presented at EMNLP 2019
Data Augmentation Review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
Inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Eda nlp for chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Paddleclas
A treasure chest for image classification powered by PaddlePaddle
Random Erasing
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
Audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Specaugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Mixup
Implementation of the mixup training method
Amazon Forest Computer Vision
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
Image augmentor
Data augmentation tool for images
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
1-60 of 112 data-augmentation projects