This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (-20.35%)

Mutual labels: automatic-speech-recognition

CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

Stars: ✭ 131 (+15.93%)

Mutual labels: speech-recognition

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+9768.14%)

Mutual labels: speech-recognition

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 6,026 (+5232.74%)

Mutual labels: speech-recognition

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-76.99%)

Mutual labels: speech-recognition

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-69.91%)

Mutual labels: speech-recognition

Keras Kaldi

Keras Interface for Kaldi ASR

Stars: ✭ 124 (+9.73%)

Mutual labels: speech-recognition

pyjsgf

JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.

Stars: ✭ 40 (-64.6%)

Mutual labels: speech-recognition

emotion-and-gender-classification

2 networks to recognition gender and emotion; face detection using Opencv or Mtcnn

Stars: ✭ 21 (-81.42%)

Mutual labels: emotion-recognition

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-55.75%)

Mutual labels: speech-recognition

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+9.73%)

Mutual labels: speech-recognition

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-83.19%)

Mutual labels: speech-recognition

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-87.61%)

Mutual labels: speech-recognition

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-69.03%)

Mutual labels: speech-recognition

Project alias

Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

Stars: ✭ 1,577 (+1295.58%)

Mutual labels: speech-recognition

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-53.98%)

Mutual labels: speech-recognition

cep

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

Stars: ✭ 140 (+23.89%)

Mutual labels: speech-recognition

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+98.23%)

Mutual labels: speech-recognition

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (+4.42%)

Mutual labels: speech-recognition

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-67.26%)

Mutual labels: speech-recognition

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-27.43%)

Mutual labels: speech-recognition

automatic speech recognition

Vietnamese Automatic Speech Recognition

Stars: ✭ 58 (-48.67%)

Mutual labels: automatic-speech-recognition

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (+0.88%)

Mutual labels: speech-recognition

IR-GAN

Augmenting Room Impulse Response

Stars: ✭ 21 (-81.42%)

Mutual labels: automatic-speech-recognition

specAugment

Tensor2tensor experiment with SpecAugment

Stars: ✭ 46 (-59.29%)

Mutual labels: speech-recognition

Modality-Transferable-MER

Modality-Transferable-MER, multimodal emotion recognition model with zero-shot and few-shot abilities.

Stars: ✭ 36 (-68.14%)

Mutual labels: emotion-recognition

Ml Road

Machine Learning Resources, Practice and Research

Stars: ✭ 1,776 (+1471.68%)

Mutual labels: speech-recognition

OpenVINO-EmotionRecognition

OpenVINO+NCS2/NCS+MutiModel(FaceDetection, EmotionRecognition)+MultiStick+MultiProcess+MultiThread+USB Camera/PiCamera. RaspberryPi 3 compatible. Async.

Stars: ✭ 51 (-54.87%)

Mutual labels: emotion-recognition

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (-42.48%)

Mutual labels: speech-recognition

Deepspeechrecognition

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Stars: ✭ 1,421 (+1157.52%)

Mutual labels: speech-recognition

SpeechEmoRec

Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching