ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+978.26%)
SnapMixSnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)
Stars: ✭ 127 (+452.17%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+1673.91%)
DataAugmentationTFImplementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.
Stars: ✭ 35 (+52.17%)
specAugmentTensor2tensor experiment with SpecAugment
Stars: ✭ 46 (+100%)
manifold mixupTensorflow implementation of the Manifold Mixup machine learning research paper
Stars: ✭ 24 (+4.35%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (+26.09%)
candockA time series signal analysis and classification framework
Stars: ✭ 56 (+143.48%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (+13.04%)
augraphyAugmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Stars: ✭ 49 (+113.04%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (+60.87%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (+4.35%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+256.52%)
OrientedRepPoints DOTAOriented Object Detection: Oriented RepPoints + Swin Transformer/ReResNet
Stars: ✭ 62 (+169.57%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+230.43%)
fastai sparse3D augmentation and transforms of 2D/3D sparse data, such as 3D triangle meshes or point clouds in Euclidean space. Extension of the Fast.ai library to train Sub-manifold Sparse Convolution Networks
Stars: ✭ 46 (+100%)
KoEDAKorean Easy Data Augmentation
Stars: ✭ 62 (+169.57%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+791.3%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+10265.22%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (+43.48%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (+56.52%)
semantic-parsing-dualSource code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".
Stars: ✭ 17 (-26.09%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-39.13%)
bird species classificationSupervised Classification of bird species 🐦 in high resolution images, especially for, Himalayan birds, having diverse species with fairly low amount of labelled data
Stars: ✭ 59 (+156.52%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+352.17%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+504.35%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (+17.39%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-8.7%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (+208.7%)
traj-pred-irlOfficial implementation codes of "Regularizing neural networks for future trajectory prediction via IRL framework"
Stars: ✭ 23 (+0%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+873.91%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+300%)
Regularization-Pruning[ICLR'21] PyTorch code for our paper "Neural Pruning via Growing Regularization"
Stars: ✭ 44 (+91.3%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (+417.39%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+300%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+3556.52%)
webdatasetA high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Stars: ✭ 816 (+3447.83%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (+52.17%)
favorite-research-papersListing my favorite research papers 📝 from different fields as I read them.
Stars: ✭ 12 (-47.83%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+391.3%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (+56.52%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+56.52%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (+165.22%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-39.13%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+126.09%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (+17.39%)
consistencyImplementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
Stars: ✭ 26 (+13.04%)