cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Stars: ✭ 140 (-12.5%)
specAugmentTensor2tensor experiment with SpecAugment
Stars: ✭ 46 (-71.25%)
masr中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Stars: ✭ 179 (+11.88%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (-60.62%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+11.88%)
multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Stars: ✭ 122 (-23.75%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-83.12%)
CaptionThis"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories
Stars: ✭ 12 (-92.5%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+28.13%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+121.25%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-83.75%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-66.25%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-66.87%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (-63.75%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-86.87%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-41.25%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-47.5%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+58.13%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+55.63%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+51.25%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+2203.13%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+37.5%)
DragonflySpeech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Stars: ✭ 209 (+30.63%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+28.13%)
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+22.5%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+21.88%)
LingvoLingvo
Stars: ✭ 2,361 (+1375.63%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+18.75%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+18.75%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+18.13%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+1223.75%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+13.75%)
CidlibThe CIDLib general purpose C++ development environment
Stars: ✭ 179 (+11.88%)
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+10%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+6.88%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+3.13%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+0.63%)
SetkTools for Speech Enhancement integrated with Kaldi
Stars: ✭ 227 (+41.88%)
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (+0.63%)