room-impulse-responsesA list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (-73.12%)
mapbox-assistant-exampleExamples of Amazon Echo, Google Home, and other bots interacting with Mapbox services.
Stars: ✭ 15 (-97.18%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-93.61%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-90.79%)
web-speech-demoLearn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-96.43%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (-78.76%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+528.01%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-95.86%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-83.46%)
xiaoai-patchPatching for XiaoAi Speakers, add custom binaries and open source software. Tested on LX06, LX01, LX05, L09A
Stars: ✭ 58 (-89.1%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-90.6%)
AlexaNew version is at https://github.com/respeaker/avs
Stars: ✭ 49 (-90.79%)
oneshot-audioExperiment with "one-shot learning" techniques to recognize a voice signature
Stars: ✭ 22 (-95.86%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-93.23%)
minutes🔭 Speaker diarization via transfer learning
Stars: ✭ 25 (-95.3%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-82.14%)
capeContinuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (-94.55%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-88.72%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-94.92%)
MMM-NFLNational Football League Module for MagicMirror²
Stars: ✭ 22 (-95.86%)
scriptionAn editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
Stars: ✭ 46 (-91.35%)
ventib📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (-91.92%)
ad-alexatalkingclockAlexa (or other Smart Speakers) tell you the time without asking every hour. Please ⭐️if you like my app :)
Stars: ✭ 30 (-94.36%)
Home Assistantconfig🏠 Home Assistant configuration & Documentation for my Smart House. Write-ups, videos, part lists, and links throughout. Be sure to ⭐ it. Updated FREQUENTLY!
Stars: ✭ 3,687 (+593.05%)
alexa-skill-clean-code-templateAlexa Skill Template with clean code (eslint, sonar), testing (unit tests, e2e), multi-language, Alexa Presentation Language (APL) and In-Skill Purchases (ISP) support. Updated to ASK-CLI V2.
Stars: ✭ 34 (-93.61%)
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-97.74%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-97.18%)
Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (-85.34%)
linear16Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-97.37%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (-13.91%)
Alexa-skills-starters💻 A collection of super cool Amazon Alexa skills for complete newbies. 💻
Stars: ✭ 24 (-95.49%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-92.48%)
Xr3player🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (-11.28%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-82.71%)
AlexaAndroidNo description or website provided.
Stars: ✭ 15 (-97.18%)
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-95.3%)
alexa-swift3-sample-appA sample iOS/Swift3 app that brings Alexa Voice Service to your phone.
Stars: ✭ 48 (-90.98%)
Tts🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+920.11%)
DeepSegmentorSequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Stars: ✭ 17 (-96.8%)
tt-vae-ganTimbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-93.05%)
converseConversational text Analysis using various NLP techniques
Stars: ✭ 147 (-72.37%)
HoobsBuild your Smart Home with HOOBS. Connect over 2,000 Accessories to your favorite Ecosystem.
Stars: ✭ 325 (-38.91%)
BangalASRTransformer based Bangla Speech Recognition
Stars: ✭ 20 (-96.24%)
voice-landing-pageFree Landing Page Bootstrap Template for Alexa Skills and Google Actions
Stars: ✭ 21 (-96.05%)
VAD-LTSDEfficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-93.05%)
Speech256An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.
Stars: ✭ 51 (-90.41%)