THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Stars: ✭ 31 (-86.34%)

Mutual labels: kaldi

Deep speaker Speaker recognition system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Stars: ✭ 174 (-23.35%)

Mutual labels: speech

Kaldi Io

c++ Kaldi IO lib (static and dynamic).

Stars: ✭ 22 (-90.31%)

Mutual labels: kaldi

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-56.39%)

Mutual labels: speech

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (+31.28%)

Mutual labels: speech

Sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (+26.87%)

Mutual labels: speech

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+2638.33%)

Mutual labels: speech

Code Switching Papers

A curated list of research papers and resources on code-switching

Stars: ✭ 122 (-46.26%)

Mutual labels: speech

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+191.19%)

Mutual labels: speech

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+755.51%)

Mutual labels: speech

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (-56.83%)

Mutual labels: kaldi

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (+22.03%)

Mutual labels: kaldi

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+174.01%)

Mutual labels: speech

Speech And Text Unity Ios Android

Speed to text in Unity iOS use Native Speech Recognition

Stars: ✭ 117 (-48.46%)

Mutual labels: speech

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+134.36%)

Mutual labels: speech

Volute

Raspberry Pi + Nodejs = Speech Robot

Stars: ✭ 224 (-1.32%)

Mutual labels: speech

Montreal Forced Aligner

Command line utility for forced alignment using Kaldi

Stars: ✭ 490 (+115.86%)

Mutual labels: kaldi

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-49.34%)

Mutual labels: speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+115.86%)

Mutual labels: speech

Ctc pytorch

CTC end -to-end ASR for timit and 863 corpus.

Stars: ✭ 161 (-29.07%)

Mutual labels: kaldi

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+92.51%)

Mutual labels: speech

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-51.1%)

Mutual labels: speech

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (-15.86%)

Mutual labels: speech

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+69.16%)

Mutual labels: speech

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+551.54%)

Mutual labels: speech

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+34.36%)

Mutual labels: speech

Tts Papers

🐸 collection of TTS papers

Stars: ✭ 160 (-29.52%)

Mutual labels: speech

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+59.47%)

Mutual labels: speech

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+497.8%)

Mutual labels: kaldi

Asr theory

语音识别理论，论文和PPT

Stars: ✭ 344 (+51.54%)

Mutual labels: kaldi

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (-7.05%)

Mutual labels: speech

Ios 10 Sampler

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+1371.81%)

Mutual labels: speech

Pytorch Kaldi Neural Speaker Embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Stars: ✭ 99 (-56.39%)

Mutual labels: kaldi

Css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Stars: ✭ 302 (+33.04%)

Mutual labels: speech

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (-31.28%)

Mutual labels: kaldi

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+30.84%)

Mutual labels: speech

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (-56.39%)

Mutual labels: speech

React Transcript Editor

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

Stars: ✭ 285 (+25.55%)

Mutual labels: kaldi

Depression Detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Stars: ✭ 187 (-17.62%)

Mutual labels: speech

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (+19.38%)

Mutual labels: kaldi

Eend

End-to-End Neural Diarization

Stars: ✭ 153 (-32.6%)

Mutual labels: kaldi

Wavenet Enhancement

Speech Enhancement using Bayesian WaveNet

Stars: ✭ 86 (-62.11%)

Mutual labels: speech

Plda

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

Stars: ✭ 85 (-62.56%)

Mutual labels: kaldi

Gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

Stars: ✭ 1,303 (+474.01%)

Mutual labels: speech

Source separation

Deep learning based speech source separation using Pytorch

Stars: ✭ 226 (-0.44%)

Mutual labels: speech

61-120 of 227 similar projects

‹

›