All Projects → IR-GAN → Similar Projects or Alternatives

89 Open source projects that are alternatives of or similar to IR-GAN

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (+328.57%)

Mutual labels: automatic-speech-recognition, augmentation, room-impulse-response, synthetic-data

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+11252.38%)

Mutual labels: automatic-speech-recognition

genalog

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Stars: ✭ 234 (+1014.29%)

Mutual labels: synthetic-data

SegSwap

(CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery

Stars: ✭ 46 (+119.05%)

Mutual labels: synthetic-data

mix3d

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021 Oral)

Stars: ✭ 183 (+771.43%)

Mutual labels: augmentation

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-38.1%)

Mutual labels: automatic-speech-recognition

mtss-gan

MTSS-GAN: Multivariate Time Series Simulation with Generative Adversarial Networks (by @firmai)

Stars: ✭ 77 (+266.67%)

Mutual labels: synthetic-data

cram

cram is a computational room acoustics module to simulate and explore various acoustic properties of a modeled space

Stars: ✭ 23 (+9.52%)

Mutual labels: room-impulse-response

CAP augmentation

Cut and paste augmentation for object detection and instance segmentation

Stars: ✭ 93 (+342.86%)

Mutual labels: augmentation

automatic speech recognition

Vietnamese Automatic Speech Recognition

Stars: ✭ 58 (+176.19%)

Mutual labels: automatic-speech-recognition

Fmix

Official implementation of 'FMix: Enhancing Mixed Sample Data Augmentation'

Stars: ✭ 252 (+1100%)

Mutual labels: augmentation

gretel-python-client

The Gretel Python Client allows you to interact with the Gretel REST API.

Stars: ✭ 28 (+33.33%)

Mutual labels: synthetic-data

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (+395.24%)

Mutual labels: automatic-speech-recognition

pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

Stars: ✭ 86 (+309.52%)

Mutual labels: room-impulse-response

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (+438.1%)

Mutual labels: automatic-speech-recognition

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Stars: ✭ 143 (+580.95%)

Mutual labels: room-impulse-response

VisDA2020

VisDA2020: 4th Visual Domain Adaptation Challenge in ECCV'20

Stars: ✭ 53 (+152.38%)

Mutual labels: synthetic-data

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+1585.71%)

Mutual labels: automatic-speech-recognition

SDMetrics

Metrics to evaluate quality and efficacy of synthetic datasets.

Stars: ✭ 67 (+219.05%)

Mutual labels: synthetic-data

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+9828.57%)

Mutual labels: automatic-speech-recognition

multi-task-defocus-deblurring-dual-pixel-nimat

Reference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …

Stars: ✭ 29 (+38.1%)

Mutual labels: synthetic-data

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+485.71%)

Mutual labels: automatic-speech-recognition

discolight

discolight is a robust, flexible and infinitely hackable library for generating image augmentations ✨

Stars: ✭ 25 (+19.05%)

Mutual labels: augmentation

wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Stars: ✭ 30 (+42.86%)

Mutual labels: automatic-speech-recognition

Torch Audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Stars: ✭ 164 (+680.95%)

Mutual labels: augmentation

Nlpaug

Data augmentation for NLP

Stars: ✭ 2,761 (+13047.62%)

Mutual labels: augmentation

textaugment

TextAugment: Text Augmentation Library

Stars: ✭ 280 (+1233.33%)

Mutual labels: augmentation

Robotics-Object-Pose-Estimation

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

Stars: ✭ 153 (+628.57%)

Mutual labels: synthetic-data

synth

The Declarative Data Generator

Stars: ✭ 958 (+4461.9%)

Mutual labels: synthetic-data

game-feature-learning

Code for paper "Cross-Domain Self-supervised Multi-task Feature Learning using Synthetic Imagery", Ren et al., CVPR'18

Stars: ✭ 68 (+223.81%)

Mutual labels: synthetic-data

hf-experiments

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (+76.19%)

Mutual labels: automatic-speech-recognition

volumentations

Library for 3D augmentations

Stars: ✭ 111 (+428.57%)

Mutual labels: augmentation

BadMedicine

Library and CLI for randomly generating medical data like you might get out of an Electronic Health Records (EHR) system

Stars: ✭ 18 (-14.29%)

Mutual labels: synthetic-data

genstar

Generation of Synthetic Populations Library

Stars: ✭ 17 (-19.05%)

Mutual labels: synthetic-data

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (+0%)

Mutual labels: automatic-speech-recognition

Imgaug

Image augmentation for machine learning experiments.

Stars: ✭ 12,107 (+57552.38%)

Mutual labels: augmentation

deep utils

An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!

Stars: ✭ 73 (+247.62%)

Mutual labels: augmentation

Three-Filters-to-Normal

Three-Filters-to-Normal: An Accurate and Ultrafast Surface Normal Estimator (RAL+ICRA'21)

Stars: ✭ 41 (+95.24%)

Mutual labels: synthetic-data

smogn

Synthetic Minority Over-Sampling Technique for Regression

Stars: ✭ 238 (+1033.33%)

Mutual labels: synthetic-data

table-evaluator

Evaluate real and synthetic datasets with each other

Stars: ✭ 44 (+109.52%)

Mutual labels: synthetic-data

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (+157.14%)

Mutual labels: automatic-speech-recognition

timber-ruby

🌲 Great Ruby logging made easy.

Stars: ✭ 155 (+638.1%)

Mutual labels: augmentation

ImageMethodReverb.jl

Room Acoustics Impulse Response Generator using the Randomized Image Method (RIM)

Stars: ✭ 23 (+9.52%)

Mutual labels: room-impulse-response

imgcrop

Simple image augmentation library focusing on random geometric cropping

Stars: ✭ 27 (+28.57%)

Mutual labels: augmentation

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+13000%)

Mutual labels: automatic-speech-recognition

Clustering-Datasets

This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.

Stars: ✭ 189 (+800%)

Mutual labels: synthetic-data

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (+4.76%)

Mutual labels: automatic-speech-recognition

zpy

Synthetic data for computer vision. An open source toolkit using Blender and Python.

Stars: ✭ 251 (+1095.24%)

Mutual labels: synthetic-data

Speech-Recognition

End-to-End Speech Recognition using Neural Networks.