GE2E-LossPytorch implementation of Generalized End-to-End Loss for speaker verification
Stars: ✭ 72 (-23.4%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-81.91%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+118.09%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+138.3%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+157.45%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+712.77%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+55.32%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-80.85%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+59.57%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+794.68%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-50%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-57.45%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+615.96%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-71.28%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+376.6%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-80.85%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-70.21%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (-32.98%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-24.47%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-54.26%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+25.53%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+75.53%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+2130.85%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+3820.21%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+71.28%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+70.21%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+102.13%)
Rnnt Speech RecognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Stars: ✭ 158 (+68.09%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+65.96%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+164.89%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+134.04%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+102.13%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (+60.64%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+60.64%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+101.06%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+58.51%)
DragonflySpeech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Stars: ✭ 209 (+122.34%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+2153.19%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (+57.45%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (+53.19%)
DlaDeep learning for audio processing
Stars: ✭ 142 (+51.06%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+163.83%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+118.09%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+93.62%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+45.74%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+43.62%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+108.51%)