timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-36.36%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+1509.09%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (+9.09%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+140.91%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+9527.27%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+327.27%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+59.09%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (+18.18%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+1972.73%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+72.73%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+727.27%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+1000%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-22.73%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+16650%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (+50%)
CidlibThe CIDLib general purpose C++ development environment
Stars: ✭ 179 (+713.64%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+304.55%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+786.36%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-4.55%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+581.82%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+763.64%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-9.09%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+759.09%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-4.55%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+372.73%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (+36.36%)
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+700%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (+40.91%)
iOSProjectsIt's project that contains different applications developed with Swift 5.7 👨💻👩🏼💻🧑🏿💻
Stars: ✭ 122 (+454.55%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+1336.36%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+677.27%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-4.55%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+650%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+831.82%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (+63.64%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+127.27%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+213.64%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+631.82%)
kaldi-python-ioA python IO interface for data accessing in kaldi
Stars: ✭ 39 (+77.27%)
Rnnt Speech RecognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Stars: ✭ 158 (+618.18%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-31.82%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (+586.36%)
cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Stars: ✭ 140 (+536.36%)