pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+1755.75%)

Mutual labels: speech-recognition

Deep-learning-And-Paper

【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等

Stars: ✭ 62 (-45.13%)

Mutual labels: speech-recognition

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+42.48%)

Mutual labels: speech-recognition

KodiSharp

Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono

Stars: ✭ 22 (-80.53%)

Mutual labels: speech-recognition

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (+41.59%)

Mutual labels: speech-recognition

wavenet-classifier

Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

Stars: ✭ 54 (-52.21%)

Mutual labels: speech-emotion-recognition

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+38.05%)

Mutual labels: speech-recognition

praise

Do stuff with your voice in the browser.

Stars: ✭ 13 (-88.5%)

Mutual labels: speech-recognition

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+33.63%)

Mutual labels: speech-recognition

Emotion and Polarity SO

An emotion classifier of text containing technical content from the SE domain

Stars: ✭ 74 (-34.51%)

Mutual labels: emotion-recognition

Openpose-based-GUI-for-Realtime-Pose-Estimate-and-Action-Recognition

GUI based on the python api of openpose in windows using cuda10 and cudnn7. Support body , hand, face keypoints estimation and data saving. Realtime gesture recognition is realized through two-layer neural network based on the skeleton collected from the gui.

Stars: ✭ 69 (-38.94%)

Mutual labels: emotion-recognition

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+73.45%)

Mutual labels: speech-recognition

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (+30.97%)

Mutual labels: speech-recognition

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-66.37%)

Mutual labels: speech-recognition

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (+27.43%)

Mutual labels: speech-recognition

Resnet-Emotion-Recognition

Identifies emotion(s) from user facial expressions

Stars: ✭ 21 (-81.42%)

Mutual labels: emotion-recognition

Aimybox Android Assistant

Embeddable custom voice assistant for Android applications

Stars: ✭ 139 (+23.01%)

Mutual labels: speech-recognition

awesome-end2end-speech-recognition

💬 A list of End-to-End speech recognition, including papers, codes and other materials

Stars: ✭ 49 (-56.64%)

Mutual labels: speech-recognition

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (+19.47%)

Mutual labels: speech-recognition

converse

Conversational text Analysis using various NLP techniques

Stars: ✭ 147 (+30.09%)

Mutual labels: emotion-recognition

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (+16.81%)

Mutual labels: speech-recognition

picovoice

The end-to-end platform for building voice products at scale

Stars: ✭ 316 (+179.65%)

Mutual labels: speech-recognition

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+13.27%)

Mutual labels: speech-recognition

wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Stars: ✭ 30 (-73.45%)

Mutual labels: automatic-speech-recognition

Pytorch Speech Commands

Speech commands recognition with PyTorch

Stars: ✭ 128 (+13.27%)

Mutual labels: speech-recognition

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (-20.35%)

Mutual labels: automatic-speech-recognition

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+9768.14%)

Mutual labels: speech-recognition

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-76.99%)

Mutual labels: speech-recognition

Keras Kaldi

Keras Interface for Kaldi ASR

Stars: ✭ 124 (+9.73%)

Mutual labels: speech-recognition

emotion-and-gender-classification

2 networks to recognition gender and emotion; face detection using Opencv or Mtcnn

Stars: ✭ 21 (-81.42%)

Mutual labels: emotion-recognition

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+9.73%)

Mutual labels: speech-recognition

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-87.61%)

Mutual labels: speech-recognition

Project alias

Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

Stars: ✭ 1,577 (+1295.58%)

Mutual labels: speech-recognition

cep

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

Stars: ✭ 140 (+23.89%)

Mutual labels: speech-recognition

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (+4.42%)

Mutual labels: speech-recognition

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-27.43%)

Mutual labels: speech-recognition

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (+0.88%)

Mutual labels: speech-recognition

specAugment

Tensor2tensor experiment with SpecAugment

Stars: ✭ 46 (-59.29%)

Mutual labels: speech-recognition

Ml Road

Machine Learning Resources, Practice and Research

Stars: ✭ 1,776 (+1471.68%)

Mutual labels: speech-recognition

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (-42.48%)

Mutual labels: speech-recognition

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+72.57%)

Mutual labels: speech-recognition

telltime

iOS application to tell the time in the British way 🇬🇧⏰

Stars: ✭ 49 (-56.64%)

Mutual labels: speech-recognition

Lingvo

Stars: ✭ 2,361 (+1989.38%)

Mutual labels: speech-recognition

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-44.25%)