A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-79.89%)

Mutual labels: speech

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (-24.14%)

Mutual labels: speech

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+257.47%)

Mutual labels: speech

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (-43.1%)

Mutual labels: speech

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+600.57%)

Mutual labels: speech

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+134.48%)

Mutual labels: speech

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-28.74%)

Mutual labels: speech

Sound Source Localization Algorithm doa estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stars: ✭ 58 (-66.67%)

Mutual labels: speech

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-20.11%)

Mutual labels: speech

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (+484.48%)

Mutual labels: speech

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-33.91%)

Mutual labels: speech

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-82.18%)

Mutual labels: speech

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+1016.09%)

Mutual labels: speech

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+279.89%)

Mutual labels: speech

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+750%)

Mutual labels: speech

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+205.75%)

Mutual labels: speech

Voc

A physical model of the human vocal tract using literate programming, based on Pink Trombone.

Stars: ✭ 129 (-25.86%)

Mutual labels: speech

Xr3player

🎧 🎼 Advanced JavaFX Media Player

Stars: ✭ 472 (+171.26%)

Mutual labels: speech

Wavenet Enhancement

Speech Enhancement using Bayesian WaveNet

Stars: ✭ 86 (-50.57%)

Mutual labels: speech

Tts

Tools to convert text to speech 📚💬

Stars: ✭ 84 (-51.72%)

Mutual labels: speech

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+134.48%)

Mutual labels: speech

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+6308.62%)

Mutual labels: speech

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-60.34%)

Mutual labels: speech

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-20.69%)

Mutual labels: speech

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-63.22%)

Mutual labels: speech

Code Switching Papers

A curated list of research papers and resources on code-switching

Stars: ✭ 122 (-29.89%)

Mutual labels: speech

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-67.24%)

Mutual labels: speech

Tts Papers

🐸 collection of TTS papers

Stars: ✭ 160 (-8.05%)

Mutual labels: speech

Stl

The ITU-T Software Tool Library (G.191)

Stars: ✭ 44 (-74.71%)

Mutual labels: speech

Speech And Text Unity Ios Android

Speed to text in Unity iOS use Native Speech Recognition

Stars: ✭ 117 (-32.76%)

Mutual labels: speech

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-77.01%)

Mutual labels: speech

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-22.41%)

Mutual labels: speech

Wsay

Windows "say"

Stars: ✭ 36 (-79.31%)

Mutual labels: speech

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-34.48%)

Mutual labels: speech

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+334.48%)

Mutual labels: speech

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+1105.17%)

Mutual labels: speech

Praat

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (+287.93%)

Mutual labels: speech

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-39.08%)

Mutual labels: speech

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+263.79%)

Mutual labels: speech

Avpi

an open source voice command macro software

Stars: ✭ 130 (-25.29%)

Mutual labels: speech

Nodejs Speech

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

Stars: ✭ 545 (+213.22%)

Mutual labels: speech

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-43.1%)

Mutual labels: speech

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+196.55%)

Mutual labels: speech

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+1006.9%)

Mutual labels: speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+181.61%)

Mutual labels: speech

Gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

Stars: ✭ 1,303 (+648.85%)

Mutual labels: speech

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+151.15%)

Mutual labels: speech

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities