pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+4892.86%)

Mutual labels: speech, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+388.1%)

Mutual labels: speech, asr

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+316.67%)

Mutual labels: speech, asr

Naver-AI-Hackathon-Speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib

Stars: ✭ 26 (-38.1%)

Mutual labels: speech, seq2seq

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-40.48%)

Mutual labels: speech, ctc

avsr-tf1

Audio-Visual Speech Recognition using Sequence to Sequence Models

Stars: ✭ 76 (+80.95%)

Mutual labels: seq2seq, asr

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+1190.48%)

Mutual labels: asr, ctc

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-50%)

Mutual labels: speech, asr

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (+985.71%)

Mutual labels: seq2seq, asr

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+166.67%)

Mutual labels: asr, ctc

sentence2vec

Deep sentence embedding using Sequence to Sequence learning

Stars: ✭ 23 (-45.24%)

Mutual labels: torch, seq2seq

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+276.19%)

Mutual labels: speech, seq2seq

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+192.86%)

Mutual labels: speech, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+388.1%)

Mutual labels: speech, asr

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-4.76%)

Mutual labels: speech, asr

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (+23.81%)

Mutual labels: speech, asr

deep-molecular-optimization

Molecular optimization by capturing chemist’s intuition using the Seq2Seq with attention and the Transformer

Stars: ✭ 60 (+42.86%)

Mutual labels: seq2seq

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-52.38%)

Mutual labels: asr

pytorch-transformer-chatbot

PyTorch v1.2에서 생긴 Transformer API 를 이용한 간단한 Chitchat 챗봇

Stars: ✭ 44 (+4.76%)

Mutual labels: seq2seq

torch-lrcn

An implementation of the LRCN in Torch

Stars: ✭ 85 (+102.38%)

Mutual labels: torch

vqa-soft

Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.

Stars: ✭ 14 (-66.67%)

Mutual labels: torch

tensorsem

Structural Equation Modeling using Torch

Stars: ✭ 36 (-14.29%)

Mutual labels: torch

transformer

Neutron: A pytorch based implementation of Transformer and its variants.

Stars: ✭ 60 (+42.86%)

Mutual labels: seq2seq

fade

A Simulation Framework for Auditory Discrimination Experiments

Stars: ✭ 12 (-71.43%)

Mutual labels: speech

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-33.33%)

Mutual labels: speech

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-54.76%)

Mutual labels: speech

kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

Stars: ✭ 24 (-42.86%)

Mutual labels: asr

ai-visual-storytelling-seq2seq

Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html

Stars: ✭ 50 (+19.05%)

Mutual labels: seq2seq

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-40.48%)

Mutual labels: asr

bouncer

An application to cycle (bounce) all nodes in a coordinated fashion in an AWS ASG or set of related ASGs

Stars: ✭ 123 (+192.86%)

Mutual labels: asg

nabaztag-php

a simple php implementation of a Nabaztag server

Stars: ✭ 14 (-66.67%)

Mutual labels: speech

Neural Conversation Models

Tensorflow based Neural Conversation Models

Stars: ✭ 29 (-30.95%)

Mutual labels: seq2seq

DLCV2018SPRING

Deep Learning for Computer Vision (CommE 5052) in NTU

Stars: ✭ 38 (-9.52%)

Mutual labels: seq2seq

HTK

The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.

Stars: ✭ 23 (-45.24%)

Mutual labels: speech

SER-datasets

A collection of datasets for the purpose of emotion recognition/detection in speech.

Stars: ✭ 74 (+76.19%)

Mutual labels: speech

neuralBlack

A Multi-Class Brain Tumor Classifier using Convolutional Neural Network with 99% Accuracy achieved by applying the method of Transfer Learning using Python and Pytorch Deep Learning Framework

Stars: ✭ 36 (-14.29%)

Mutual labels: torch

Base-On-Relation-Method-Extract-News-DA-RNN-Model-For-Stock-Prediction--Pytorch

基於關聯式新聞提取方法之雙階段注意力機制模型用於股票預測

Stars: ✭ 33 (-21.43%)

Mutual labels: seq2seq

Embedding

Embedding模型代码和学习笔记总结

Stars: ✭ 25 (-40.48%)

Mutual labels: seq2seq

dts

A Keras library for multi-step time-series forecasting.

Stars: ✭ 130 (+209.52%)

Mutual labels: seq2seq

1-60 of 593 similar projects

›

next*5