Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (+50%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (+21.43%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+14878.57%)
titanium-speechUse the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.
Stars: ✭ 21 (+50%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (+28.57%)
scibloxsciblox - Easier Data Science and Machine Learning
Stars: ✭ 48 (+242.86%)
praiseDo stuff with your voice in the browser.
Stars: ✭ 13 (-7.14%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+642.86%)
getcontactFind info about user by phone number using GetContact API
Stars: ✭ 228 (+1528.57%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+328.57%)
machine-learning-data-pipelinePipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (+57.14%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+50%)
KodiSharpUse Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (+57.14%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+450%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+171.43%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (+21.43%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+2157.14%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (+250%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (+50%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+392.86%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (+78.57%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+3157.14%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (+28.57%)
modelscriptREPO MOVED TO https://github.com/repetere/jsonstack-data - Data Science and Machine learning in JavaScript
Stars: ✭ 40 (+185.71%)
pyAudioProcessingAudio feature extraction and classification
Stars: ✭ 165 (+1078.57%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (+150%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+407.14%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+535.71%)
google-voiceRuby interaction with Google Voice
Stars: ✭ 16 (+14.29%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+971.43%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+364.29%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (+50%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (+114.29%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+642.86%)
iOSProjectsIt's project that contains different applications developed with Swift 5.7 👨💻👩🏼💻🧑🏿💻
Stars: ✭ 122 (+771.43%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (+2114.29%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (+121.43%)
PhoneCountryCodePickerAn iOS tableview picker for PhoneCountryCode (English & Chinese supported)
Stars: ✭ 31 (+121.43%)
hf-experimentsExperiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (+164.29%)
nuts-mlFlow-based data pre-processing for deep learning
Stars: ✭ 32 (+128.57%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (+0%)
cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Stars: ✭ 140 (+900%)
bananapi-zero-ubuntu-base-minimalBananaPi M2 Zero - Ubuntu Focal Base Minimal Image (Experimental) - U-Boot 2017.09 / Kernel 4.18.y / Kernel 4.19.y / Kernel 4.20.y / Kernel 5.3.y / Kernel 5.6.y / Kernel 5.7.y / Kernel 5.11.y
Stars: ✭ 77 (+450%)
specAugmentTensor2tensor experiment with SpecAugment
Stars: ✭ 46 (+228.57%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+257.14%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+150%)
masr中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Stars: ✭ 179 (+1178.57%)
ConvolutionaNeuralNetworksToEnhanceCodedSpeechIn this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
Stars: ✭ 25 (+78.57%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+700%)