octopusOn-device speech-to-index engine powered by deep learning.
iOSProjectsIt's project that contains different applications developed with Swift 5.7 👨💻👩🏼💻🧑🏿💻
picovoiceThe end-to-end platform for building voice products at scale
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
masr中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
CaptionThis"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
leopardOn-device speech-to-text engine powered by deep learning
obviA Polymer 3+ webcomponent / button for doing speech recognition
anycontrolVoice control for your websites and applications
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
megsA merged version of multiple open-source German speech datasets.
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
idear🎙️ Handsfree Audio Development Interface