All Categories → Machine Learning → speech-recognition

Top 326 speech-recognition open source projects

Tensorflow-Keyword-Spotting
Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
A chronology of deep learning
Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Deep-learning-And-Paper
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等
srvk-eesen-offline-transcriber
Top level code to transcribe English audio/video files into text/subtitles
speechless
Speech-to-text based on wav2letter built for transfer learning
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
syn-speech-samples
An application that demostrate the usage of Syn.Speech library for Speech Recognition
pytorch audio
audio processing module for pytorch:stft, istft
VoiceDictation
迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Inimesed
An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
api
Speechly public API definitions and generated code
salutejs
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Android-TTS-STT
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Khronos
The open source intelligent personal assistant
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
titanium-speech
Use the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.
KodiSharp
Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
praise
Do stuff with your voice in the browser.
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
241-300 of 326 speech-recognition projects