pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+1916.35%)

Mutual labels: speech-recognition

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+493.27%)

Mutual labels: speech-recognition

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-79.81%)

Mutual labels: speech-recognition

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+1322.12%)

Mutual labels: speech-recognition

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-76.92%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+421.15%)

Mutual labels: speech-recognition

Speech recognition

中文语音识别

Stars: ✭ 534 (+413.46%)

Mutual labels: speech-recognition

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (+408.65%)

Mutual labels: speech-recognition

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (-44.23%)

Mutual labels: speech-recognition

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (+53.85%)

Mutual labels: speech-recognition

awesome-end2end-speech-recognition

💬 A list of End-to-End speech recognition, including papers, codes and other materials

Stars: ✭ 49 (-52.88%)

Mutual labels: speech-recognition

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (+53.85%)

Mutual labels: speech-recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (+362.5%)

Mutual labels: speech-recognition

VoiceDictation

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Stars: ✭ 36 (-65.38%)

Mutual labels: speech-recognition

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+135.58%)

Mutual labels: speech-to-text

Rnnt Speech Recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

Stars: ✭ 158 (+51.92%)

Mutual labels: speech-recognition

Rhasspy

Offline private voice assistant for many human languages

Stars: ✭ 458 (+340.38%)

Mutual labels: speech-recognition

Go Astibob

Golang framework to build an AI that can understand and speak back to you, and everything else you want

Stars: ✭ 222 (+113.46%)

Mutual labels: speech-to-text

iOSProjects

It's project that contains different applications developed with Swift 5.7 👨‍💻👩🏼‍💻🧑🏿‍💻

Stars: ✭ 122 (+17.31%)

Mutual labels: speech-recognition

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+50%)

Mutual labels: speech-recognition

Uspeech

Speech recognition toolkit for the arduino

Stars: ✭ 448 (+330.77%)

Mutual labels: speech-recognition

Speaker adapted tts

Making a TTS model with 1 minute of speech samples within 10 minutes

Stars: ✭ 183 (+75.96%)

Mutual labels: speech-to-text

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (-5.77%)

Mutual labels: speech-recognition

houndify-sdk-go

The official Houndify SDK for Go

Stars: ✭ 23 (-77.88%)

Mutual labels: speech-recognition

Nlp Models Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Stars: ✭ 1,603 (+1441.35%)

Mutual labels: speech-to-text

Ai Study

人工智能学习资料超全整理，包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题

Stars: ✭ 93 (-10.58%)

Mutual labels: speech-recognition

Casr Demo

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

Stars: ✭ 76 (-26.92%)

Mutual labels: speech-to-text

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-19.23%)

Mutual labels: speech-recognition

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-38.46%)

Mutual labels: speech-to-text

Cross vc

Cross-lingual Voice Conversion

Stars: ✭ 91 (-12.5%)

Mutual labels: speech-recognition

Botium Speech Processing

Stars: ✭ 908 (+773.08%)

Mutual labels: speech-to-text

Speech Transformer Tf2.0

transformer for ASR-systerm (via tensorflow2.0)

Stars: ✭ 90 (-13.46%)

Mutual labels: speech-recognition

Nonocaptcha

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

Stars: ✭ 744 (+615.38%)

Mutual labels: speech-to-text

Clovacall

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

Stars: ✭ 151 (+45.19%)

Mutual labels: speech-recognition

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+292.31%)

Mutual labels: speech-recognition

Open stt

Open STT

Stars: ✭ 584 (+461.54%)

Mutual labels: speech-to-text

Speech Emotion Recognition

Detecting emotions using MFCC features of human speech using Deep Learning

Stars: ✭ 89 (-14.42%)

Mutual labels: speech-recognition

Autoedit 2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Stars: ✭ 343 (+229.81%)

Mutual labels: speech-to-text

picovoice

The end-to-end platform for building voice products at scale

Stars: ✭ 316 (+203.85%)

Mutual labels: speech-recognition

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+292.31%)

Mutual labels: speech-recognition

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (+138.46%)

Mutual labels: speech-recognition

TinyCog

Small Robot, Toy Robot platform

Stars: ✭ 29 (-72.12%)

Mutual labels: speech-recognition

Swiftspeech

A speech recognition framework designed for SwiftUI.

Stars: ✭ 149 (+43.27%)

Mutual labels: speech-recognition

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (+282.69%)

Mutual labels: speech-recognition

LipNet

Automated Lip reading from real-time videos in tensorflow in python

Stars: ✭ 113 (+8.65%)

Mutual labels: lip-reading

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (+42.31%)

Mutual labels: speech-recognition

Free Spoken Digit Dataset

A free audio dataset of spoken digits. Think MNIST for audio.

Stars: ✭ 396 (+280.77%)

Mutual labels: speech-recognition

301-360 of 369 similar projects

first

‹

›