Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

Stars: ✭ 37 (-58.89%)

Mutual labels: speech, generative-adversarial-network

ImageMethodReverb.jl

Room Acoustics Impulse Response Generator using the Randomized Image Method (RIM)

Stars: ✭ 23 (-74.44%)

Mutual labels: acoustics, room-impulse-response

NBSS

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (-14.44%)

Mutual labels: speech

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+227.78%)

Mutual labels: speech

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Stars: ✭ 278 (+208.89%)

Mutual labels: speech

Naver-AI-Hackathon-Speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib

Stars: ✭ 26 (-71.11%)

Mutual labels: speech

deep-blueberry

If you've always wanted to learn about deep-learning but don't know where to start, then you might have stumbled upon the right place!

Stars: ✭ 17 (-81.11%)

Mutual labels: generative-adversarial-network

WearLock

Using Android Watch to unlock Android phone via acoustic tokens.

Stars: ✭ 12 (-86.67%)

Mutual labels: acoustics

browser-apis

🦄 Cool & Fun Browser Web APIs 🥳

Stars: ✭ 21 (-76.67%)

Mutual labels: speech

Voice Gender

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+175.56%)

Mutual labels: speech

smogn

Synthetic Minority Over-Sampling Technique for Regression

Stars: ✭ 238 (+164.44%)

Mutual labels: synthetic-data

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+168.89%)

Mutual labels: speech

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+168.89%)

Mutual labels: speech

publications-arruda-ijcnn-2019

Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night

Stars: ✭ 59 (-34.44%)

Mutual labels: generative-adversarial-network

deep utils

An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!

Stars: ✭ 73 (-18.89%)

Mutual labels: augmentation

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

Stars: ✭ 80 (-11.11%)

Mutual labels: speech

Gcc Nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

Stars: ✭ 231 (+156.67%)

Mutual labels: speech

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-72.22%)

Mutual labels: speech

eidos-audition

Collection of auditory models.

Stars: ✭ 25 (-72.22%)

Mutual labels: speech

react-native-speech-bubble

💬 A speech bubble dialog component for React Native.

Stars: ✭ 50 (-44.44%)

Mutual labels: speech

market risk gan tensorflow

Using Bidirectional Generative Adversarial Networks to estimate Value-at-Risk for Market Risk Management using TensorFlow.

Stars: ✭ 63 (-30%)

Mutual labels: generative-adversarial-network

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-6.67%)

Mutual labels: speech

cape

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-67.78%)

Mutual labels: speech

MultiGraphGAN

MultiGraphGAN for predicting multiple target graphs from a source graph using geometric deep learning.

Stars: ✭ 16 (-82.22%)

Mutual labels: generative-adversarial-network

Source separation

Deep learning based speech source separation using Pytorch

Stars: ✭ 226 (+151.11%)

Mutual labels: speech

lectures-all

Central repository for all lectures on deep learning at UPC ETSETB TelecomBCN.

Stars: ✭ 46 (-48.89%)

Mutual labels: speech

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+98.89%)

Mutual labels: speech

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+172.22%)

Mutual labels: speech

CycleGAN-gluon-mxnet

this repo attemps to reproduce Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks(CycleGAN) use gluon reimplementation

Stars: ✭ 31 (-65.56%)

Mutual labels: generative-adversarial-network

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+172.22%)

Mutual labels: speech

ventib

📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.

Stars: ✭ 43 (-52.22%)

Mutual labels: speech

Lhotse

Stars: ✭ 236 (+162.22%)

Mutual labels: speech

Anime2Sketch

A sketch extractor for anime/illustration.

Stars: ✭ 1,623 (+1703.33%)

Mutual labels: generative-adversarial-network

Setk

Tools for Speech Enhancement integrated with Kaldi

Stars: ✭ 227 (+152.22%)

Mutual labels: speech

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+127.78%)

Mutual labels: speech

Volute

Raspberry Pi + Nodejs = Speech Robot

Stars: ✭ 224 (+148.89%)

Mutual labels: speech

txt2speech

Convert text to speech using Google Translate API

Stars: ✭ 38 (-57.78%)

Mutual labels: speech

Speech Denoiser

A speech denoise lv2 plugin based on RNNoise library

Stars: ✭ 220 (+144.44%)

Mutual labels: speech

Speech Enhancement

Deep learning for audio denoising

Stars: ✭ 207 (+130%)

Mutual labels: speech

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+293.33%)

Mutual labels: automatic-speech-recognition

Tts Cube

End-2-end speech synthesis with recurrent neural networks

Stars: ✭ 213 (+136.67%)

Mutual labels: speech

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (+134.44%)

Mutual labels: speech

SignDetect

This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.

Stars: ✭ 21 (-76.67%)

Mutual labels: speech

precision-recall-distributions

Assessing Generative Models via Precision and Recall (official repository)

Stars: ✭ 80 (-11.11%)

Mutual labels: generative-adversarial-network

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (-40%)

Mutual labels: automatic-speech-recognition

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )