Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+283.12%)

Mutual labels: speech

Volute

Raspberry Pi + Nodejs = Speech Robot

Stars: ✭ 224 (+190.91%)

Mutual labels: speech

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (+79.22%)

Mutual labels: speech

Avpi

an open source voice command macro software

Stars: ✭ 130 (+68.83%)

Mutual labels: speech

Timit

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

Stars: ✭ 202 (+162.34%)

Mutual labels: speech

lectures-all

Central repository for all lectures on deep learning at UPC ETSETB TelecomBCN.

Stars: ✭ 46 (-40.26%)

Mutual labels: speech

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (+145.45%)

Mutual labels: speech

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (-24.68%)

Mutual labels: speech

Siricontrol System

Control anything with Siri voice commands.

Stars: ✭ 180 (+133.77%)

Mutual labels: speech

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+218.18%)

Mutual labels: speech

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+2623.38%)

Mutual labels: speech

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

Stars: ✭ 80 (+3.9%)

Mutual labels: speech

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+2401.3%)

Mutual labels: speech

Setk

Tools for Speech Enhancement integrated with Kaldi

Stars: ✭ 227 (+194.81%)

Mutual labels: speech

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (+75.32%)

Mutual labels: speech

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Stars: ✭ 278 (+261.04%)

Mutual labels: speech

Speech Enhancement

Deep learning for audio denoising

Stars: ✭ 207 (+168.83%)

Mutual labels: speech

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+66.23%)

Mutual labels: speech

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+14381.82%)

Mutual labels: speech

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+166.23%)

Mutual labels: speech

browser-apis

🦄 Cool & Fun Browser Web APIs 🥳

Stars: ✭ 21 (-72.73%)

Mutual labels: speech

Esp8266sam

Speech synthesis for ESP8266 using S.A.M. port

Stars: ✭ 199 (+158.44%)

Mutual labels: speech

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-31.17%)

Mutual labels: speech

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (+148.05%)

Mutual labels: speech

Voice Gender

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+222.08%)

Mutual labels: speech

Depression Detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Stars: ✭ 187 (+142.86%)

Mutual labels: speech

ventib

📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.

Stars: ✭ 43 (-44.16%)

Mutual labels: speech

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (+136.36%)

Mutual labels: speech

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+214.29%)

Mutual labels: speech

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+127.27%)

Mutual labels: speech

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-67.53%)

Mutual labels: speech

Chatbot Watson Android

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Stars: ✭ 169 (+119.48%)

Mutual labels: speech

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+214.29%)

Mutual labels: speech

Tacotron asr

Speech Recognition Using Tacotron