A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-82.67%)

Mutual labels: speech

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-18.32%)

Mutual labels: speech

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+2977.23%)

Mutual labels: speech

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-41.58%)

Mutual labels: speech

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-43.56%)

Mutual labels: speech

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+213.37%)

Mutual labels: speech

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+853.47%)

Mutual labels: speech

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-47.52%)

Mutual labels: speech

Siricontrol System

Control anything with Siri voice commands.

Stars: ✭ 180 (-10.89%)

Mutual labels: speech

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-50.99%)

Mutual labels: speech

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-31.68%)

Mutual labels: speech

Gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

Stars: ✭ 1,303 (+545.05%)

Mutual labels: speech

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (-6.44%)

Mutual labels: speech

Audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Stars: ✭ 1,262 (+524.75%)

Mutual labels: speech

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-33.17%)

Mutual labels: speech

Tts

Tools to convert text to speech 📚💬

Stars: ✭ 84 (-58.42%)

Mutual labels: speech

Deep speaker Speaker recognition system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Stars: ✭ 174 (-13.86%)

Mutual labels: speech

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-65.84%)

Mutual labels: speech

Avpi

an open source voice command macro software

Stars: ✭ 130 (-35.64%)

Mutual labels: speech

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-68.32%)

Mutual labels: speech

Lingvo

Stars: ✭ 2,361 (+1068.81%)

Mutual labels: speech

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-71.78%)

Mutual labels: speech

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-36.63%)

Mutual labels: speech

Stl

The ITU-T Software Tool Library (G.191)

Stars: ✭ 44 (-78.22%)

Mutual labels: speech

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+938.12%)

Mutual labels: speech

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-80.2%)

Mutual labels: speech

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+5420.3%)

Mutual labels: speech

Wsay

Windows "say"