soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+527.78%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+13144.44%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+583.33%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+477.78%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (+200%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (+22.22%)
hf-experimentsExperiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (+105.56%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+1866.67%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+2077.78%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+7555.56%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+422.22%)
FAST-RIRThis is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+400%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+522.22%)
DCGCNDensely Connected Graph Convolutional Networks for Graph-to-Sequence Learning (authors' MXNet implementation for the TACL19 paper)
Stars: ✭ 73 (+305.56%)
MDAPLThe de facto standard for people who are looking to learn Dyalog APL from a book. This updated version is a work in progress.
Stars: ✭ 24 (+33.33%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (+172.22%)
bergamot-translatorCross platform C++ library focusing on optimized machine translation on the consumer-grade device.
Stars: ✭ 181 (+905.56%)
titanium-speechUse the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.
Stars: ✭ 21 (+16.67%)
masr中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Stars: ✭ 179 (+894.44%)
Speaker-RecognitionThis repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Stars: ✭ 94 (+422.22%)
DLAIEMaterials for Hawley's Deep Learning & AI Ethics course
Stars: ✭ 27 (+50%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (+38.89%)
KodiSharpUse Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (+22.22%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+894.44%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (+16.67%)
TS3000 TheChatBOTIts a social networking chat-bot trained on Reddit dataset . It supports open bounded queries developed on the concept of Neural Machine Translation. Beware of its being sarcastic just like its creator 😝 BDW it uses Pytorch framework and Python3.
Stars: ✭ 20 (+11.11%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+283.33%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+2433.33%)
cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Stars: ✭ 140 (+677.78%)
reinforcement learning course materialsLecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
Stars: ✭ 765 (+4150%)
specAugmentTensor2tensor experiment with SpecAugment
Stars: ✭ 46 (+155.56%)
bytenet translationA TensorFlow Implementation of Machine Translation In Neural Machine Translation in Linear Time
Stars: ✭ 60 (+233.33%)
GE2E-LossPytorch implementation of Generalized End-to-End Loss for speaker verification
Stars: ✭ 72 (+300%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+394.44%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (+250%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (+94.44%)
multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Stars: ✭ 122 (+577.78%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+733.33%)
CaptionThis"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories
Stars: ✭ 12 (-33.33%)
Voice-MLMobileNet trained with VoxCeleb dataset and used for voice verification
Stars: ✭ 15 (-16.67%)
Neural-Machine-TranslationSeveral basic neural machine translation models implemented by PyTorch & TensorFlow
Stars: ✭ 29 (+61.11%)
praiseDo stuff with your voice in the browser.
Stars: ✭ 13 (-27.78%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+1038.89%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (+16.67%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (+44.44%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (+0%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+94.44%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+111.11%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+194.44%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (+66.67%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+222.22%)