VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+331.94%)
DeepspeechrecognitionA Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Stars: ✭ 1,421 (+886.81%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+1937.5%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (-17.36%)
HapticaEasy Haptic Feedback Generator 📳
Stars: ✭ 587 (+307.64%)
Casr Demo基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Stars: ✭ 76 (-47.22%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-68.06%)
Flawless IosAwesome iOS guides from the community, shared on Flawless iOS Medium blog 👉
Stars: ✭ 260 (+80.56%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-83.33%)
Wavesurfer.jsNavigable waveform built on Web Audio and Canvas
Stars: ✭ 5,905 (+4000.69%)
vspeech📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜
Stars: ✭ 38 (-73.61%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+276.39%)
favorite-research-papersListing my favorite research papers 📝 from different fields as I read them.
Stars: ✭ 12 (-91.67%)
UnityandroidspeechrecognitionThis repository is a Unity plugin for Android Speech Recognition (based on Java implementation)
Stars: ✭ 73 (-49.31%)
HotVoiceAdds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-71.53%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-31.25%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-68.75%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (-79.86%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-26.39%)
homebridge-deebotHomebridge plugin to integrate ECOVACS Deebot devices into HomeKit.
Stars: ✭ 39 (-72.92%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-38.89%)
klangsyntheseWaveform and Audio Synthesis library in Go
Stars: ✭ 57 (-60.42%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-70.14%)
SounderAn intent recognizing algorithm to predict the intent of a given text.
Stars: ✭ 118 (-18.06%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-31.94%)
Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-70.14%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-90.28%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-50.69%)
Wfplayer🌊 WFPlayer.js is an audio waveform generator
Stars: ✭ 124 (-13.89%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-50.69%)
SwiftybuttonSimple and customizable button in Swift
Stars: ✭ 471 (+227.08%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-89.58%)
MusicottJavaFX application that manages and plays music files.
Stars: ✭ 97 (-32.64%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+606.25%)
araAra is a golang server for real-time public transport data exchange, using the SIRI protocol.
Stars: ✭ 12 (-91.67%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-84.72%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (-46.53%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+211.11%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-72.22%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-58.33%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (-65.97%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-18.06%)
FdwaveformviewReads an audio file and displays the waveform
Stars: ✭ 997 (+592.36%)
BangalASRTransformer based Bangla Speech Recognition
Stars: ✭ 20 (-86.11%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-54.17%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-22.22%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+589.58%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-78.47%)
Addon Homebridge Homebridge - Community Hass.io Add-on for Home Assistant
Stars: ✭ 96 (-33.33%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-63.19%)
HhcustomcornerAwesome library to customize corners of UIView and UIButton. Now you can customize each corner differently
Stars: ✭ 36 (-75%)
denver.luaa simple library to help you play custom waveforms with LÖVE
Stars: ✭ 66 (-54.17%)
EmspinnerbuttonUIButton sublcass with loading animation
Stars: ✭ 117 (-18.75%)
Ai Study人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题
Stars: ✭ 93 (-35.42%)
MclightingThe ESP8266 based multi-client lighting gadget
Stars: ✭ 977 (+578.47%)