CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (-81.72%)

Mutual labels: speech

Sound Source Localization Algorithm doa estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stars: ✭ 58 (-96.99%)

Mutual labels: speech

Espeak

eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.

Stars: ✭ 339 (-82.4%)

Mutual labels: speech-synthesis

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (-21.65%)

Mutual labels: speech-synthesis

Ios 10 Sampler

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+73.47%)

Mutual labels: speech

Pink Trombone

A programmable version of Neil Thapen's Pink Trombone

Stars: ✭ 54 (-97.2%)

Mutual labels: speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-83.13%)

Mutual labels: speech-synthesis

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Stars: ✭ 136 (-92.94%)

Mutual labels: speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-83.8%)

Mutual labels: speech-synthesis

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-97.3%)

Mutual labels: speech-synthesis

Numpy Ml

Machine learning, in numpy

Stars: ✭ 11,100 (+476.32%)

Mutual labels: wavenet

Soloud

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (-45.59%)

Mutual labels: speech

Sam

Software Automatic Mouth - Tiny Speech Synthesizer

Stars: ✭ 667 (-65.37%)

Mutual labels: speech-synthesis

Sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (-85.05%)

Mutual labels: speech

Reconstructing faces from voices

An example of the paper "reconstructing faces from voices"

Stars: ✭ 127 (-93.41%)

Mutual labels: speech

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-85.51%)

Mutual labels: speech-synthesis

Tacotron2

pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf

Stars: ✭ 46 (-97.61%)

Mutual labels: wavenet

Audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Stars: ✭ 1,262 (-34.48%)

Mutual labels: speech

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (-65.68%)

Mutual labels: speech

Wave U Net For Speech Enhancement

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Stars: ✭ 106 (-94.5%)

Mutual labels: speech-processing

Awesome Speech Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Stars: ✭ 257 (-86.66%)

Mutual labels: speech-processing

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-97.77%)

Mutual labels: speech-processing

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (-67.13%)

Mutual labels: speech

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Stars: ✭ 139 (-92.78%)

Mutual labels: speech-processing

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (-47.2%)

Mutual labels: speech

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (-23.21%)

Mutual labels: speech

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-93.87%)

Mutual labels: speech

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (-34.68%)

Mutual labels: speech

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (-67.71%)

Mutual labels: speech

voice-conversion

an tutorial implement of voice conversion using pytorch

Stars: ✭ 26 (-98.65%)

Mutual labels: speech-synthesis

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-97.92%)

Mutual labels: speech

EmotionalConversionStarGAN

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

Stars: ✭ 92 (-95.22%)

Mutual labels: speech-synthesis

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+478.97%)

Mutual labels: speech

flite-go

Go bindings for Flite (festival-lite)

Stars: ✭ 14 (-99.27%)

Mutual labels: speech

Vq Vae Wavenet

TensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810