dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-41.18%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2373.53%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (+58.82%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (+11.76%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-20.59%)
Speaker-RecognitionThis repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Stars: ✭ 94 (+176.47%)
D-TDNNPyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (+76.47%)
GE2E-LossPytorch implementation of Generalized End-to-End Loss for speaker verification
Stars: ✭ 72 (+111.76%)
Voice-MLMobileNet trained with VoxCeleb dataset and used for voice verification
Stars: ✭ 15 (-55.88%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+176.47%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+123.53%)
meta-SRPytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
Stars: ✭ 58 (+70.59%)
Speaker-IdentificationA program for automatic speaker identification using deep learning techniques.
Stars: ✭ 84 (+147.06%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+558.82%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+52.94%)
speakerIdentificationNeuralNetworks⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The hi…
Stars: ✭ 26 (-23.53%)
voice gender detection♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (+50%)
DiViMeACLEW Diarization Virtual Machine
Stars: ✭ 28 (-17.65%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+941.18%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-35.29%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-47.06%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+341.18%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+767.65%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+108.82%)
Datadriven-GPVADThe codebase for Data-driven general-purpose voice activity detection.
Stars: ✭ 81 (+138.24%)
SmartMirrorMy MagicMirror running on a Raspberry Pi
Stars: ✭ 110 (+223.53%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-38.24%)
AutoSpeech[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Stars: ✭ 195 (+473.53%)
CasterDragonfly-Based Voice Programming and Accessibility Toolkit
Stars: ✭ 242 (+611.76%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-11.76%)
Voicebook🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Stars: ✭ 236 (+594.12%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-47.06%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+829.41%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (+0%)
Project news alan aiIn this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.
Stars: ✭ 202 (+494.12%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+455.88%)
QuietVRA Quiet Place in VR: Generate any 3D object with your voice. It's magic!
Stars: ✭ 17 (-50%)
awesome-speech-enhancementA curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (+41.18%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (+811.76%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+435.29%)
PiwhoSpeaker recognition library based on MARF for raspberry pi and other SBCs.
Stars: ✭ 50 (+47.06%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+338.24%)
Mmm Awesome AlexaTurn your MagicMirror into an 'Amazon Echo'. Activated when you say 'Alexa'.
Stars: ✭ 122 (+258.82%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+202.94%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+3891.18%)