All Projects → FAST-RIR → Similar Projects or Alternatives

670 Open source projects that are alternatives of or similar to FAST-RIR

IR-GAN
Augmenting Room Impulse Response
Stars: ✭ 21 (-76.67%)
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+58.89%)
SDGym
Benchmarking synthetic data generation methods.
Stars: ✭ 177 (+96.67%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+36.67%)
DeepEcho
Synthetic Data Generation for mixed-type, multivariate time series.
Stars: ✭ 44 (-51.11%)
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-85.56%)
mtss-gan
MTSS-GAN: Multivariate Time Series Simulation with Generative Adversarial Networks (by @firmai)
Stars: ✭ 77 (-14.44%)
cram
cram is a computational room acoustics module to simulate and explore various acoustic properties of a modeled space
Stars: ✭ 23 (-74.44%)
Mutual labels:  acoustics, room-impulse-response
icassp2019-latex-template
ICASSP 2019 official Latex template
Stars: ✭ 21 (-76.67%)
Mutual labels:  speech, acoustics
tt-vae-gan
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-58.89%)
ImageMethodReverb.jl
Room Acoustics Impulse Response Generator using the Randomized Image Method (RIM)
Stars: ✭ 23 (-74.44%)
Mutual labels:  acoustics, room-impulse-response
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (-14.44%)
Mutual labels:  speech
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+227.78%)
Mutual labels:  speech
VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+208.89%)
Mutual labels:  speech
Naver-AI-Hackathon-Speech
2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib
Stars: ✭ 26 (-71.11%)
Mutual labels:  speech
deep-blueberry
If you've always wanted to learn about deep-learning but don't know where to start, then you might have stumbled upon the right place!
Stars: ✭ 17 (-81.11%)
WearLock
Using Android Watch to unlock Android phone via acoustic tokens.
Stars: ✭ 12 (-86.67%)
Mutual labels:  acoustics
browser-apis
🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (-76.67%)
Mutual labels:  speech
Voice Gender
Gender recognition by voice and speech analysis
Stars: ✭ 248 (+175.56%)
Mutual labels:  speech
smogn
Synthetic Minority Over-Sampling Technique for Regression
Stars: ✭ 238 (+164.44%)
Mutual labels:  synthetic-data
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+168.89%)
Mutual labels:  speech
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+168.89%)
Mutual labels:  speech
publications-arruda-ijcnn-2019
Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
Stars: ✭ 59 (-34.44%)
deep utils
An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!
Stars: ✭ 73 (-18.89%)
Mutual labels:  augmentation
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (-11.11%)
Mutual labels:  speech
Gcc Nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Stars: ✭ 231 (+156.67%)
Mutual labels:  speech
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-72.22%)
Mutual labels:  speech
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (-72.22%)
Mutual labels:  speech
react-native-speech-bubble
💬 A speech bubble dialog component for React Native.
Stars: ✭ 50 (-44.44%)
Mutual labels:  speech
market risk gan tensorflow
Using Bidirectional Generative Adversarial Networks to estimate Value-at-Risk for Market Risk Management using TensorFlow.
Stars: ✭ 63 (-30%)
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-6.67%)
Mutual labels:  speech
cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (-67.78%)
Mutual labels:  speech
MultiGraphGAN
MultiGraphGAN for predicting multiple target graphs from a source graph using geometric deep learning.
Stars: ✭ 16 (-82.22%)
Source separation
Deep learning based speech source separation using Pytorch
Stars: ✭ 226 (+151.11%)
Mutual labels:  speech
lectures-all
Central repository for all lectures on deep learning at UPC ETSETB TelecomBCN.
Stars: ✭ 46 (-48.89%)
Mutual labels:  speech
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+98.89%)
Mutual labels:  speech
Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+172.22%)
Mutual labels:  speech
CycleGAN-gluon-mxnet
this repo attemps to reproduce Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks(CycleGAN) use gluon reimplementation
Stars: ✭ 31 (-65.56%)
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+172.22%)
Mutual labels:  speech
ventib
📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (-52.22%)
Mutual labels:  speech
Lhotse
Stars: ✭ 236 (+162.22%)
Mutual labels:  speech
Anime2Sketch
A sketch extractor for anime/illustration.
Stars: ✭ 1,623 (+1703.33%)
Setk
Tools for Speech Enhancement integrated with Kaldi
Stars: ✭ 227 (+152.22%)
Mutual labels:  speech
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+127.78%)
Mutual labels:  speech
Volute
Raspberry Pi + Nodejs = Speech Robot
Stars: ✭ 224 (+148.89%)
Mutual labels:  speech
txt2speech
Convert text to speech using Google Translate API
Stars: ✭ 38 (-57.78%)
Mutual labels:  speech
Speech Denoiser
A speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (+144.44%)
Mutual labels:  speech
Speech Enhancement
Deep learning for audio denoising
Stars: ✭ 207 (+130%)
Mutual labels:  speech
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+293.33%)
Tts Cube
End-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (+136.67%)
Mutual labels:  speech
Neural Voice Cloning With Few Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Stars: ✭ 211 (+134.44%)
Mutual labels:  speech
SignDetect
This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.
Stars: ✭ 21 (-76.67%)
Mutual labels:  speech
precision-recall-distributions
Assessing Generative Models via Precision and Recall (official repository)
Stars: ✭ 80 (-11.11%)
obvi
A Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-40%)
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+127.78%)
Mutual labels:  speech
Timit
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (+124.44%)
Mutual labels:  speech
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-41.11%)
Mutual labels:  speech
Esp8266sam
Speech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (+121.11%)
Mutual labels:  speech
Lingvo
Lingvo
Stars: ✭ 2,361 (+2523.33%)
Mutual labels:  speech
30-Days-GANs-Paper-Reading
30 Days GANs Paper Reading
Stars: ✭ 41 (-54.44%)
1-60 of 670 similar projects