AESRC2020a deep accent recognition network
Stars: ✭ 35 (-75%)
GE2E-LossPytorch implementation of Generalized End-to-End Loss for speaker verification
Stars: ✭ 72 (-48.57%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+152.86%)
CasterDragonfly-Based Voice Programming and Accessibility Toolkit
Stars: ✭ 242 (+72.86%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-61.43%)
SmartMirrorMy MagicMirror running on a Raspberry Pi
Stars: ✭ 110 (-21.43%)
insightfaceimplementation of insightface by using Tensorflow
Stars: ✭ 97 (-30.71%)
meta-embeddingsMeta-embeddings are a probabilistic generalization of embeddings in machine learning.
Stars: ✭ 22 (-84.29%)
Deepstream ProjectThis is a highly separated deployment project based on Deepstream , including the full range of Yolo and continuously expanding deployment projects such as Ocr.
Stars: ✭ 120 (-14.29%)
D-TDNNPyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (-57.14%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+35%)
Mmm Awesome AlexaTurn your MagicMirror into an 'Amazon Echo'. Activated when you say 'Alexa'.
Stars: ✭ 122 (-12.86%)
voice gender detection♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (-63.57%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-85%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+869.29%)
PiwhoSpeaker recognition library based on MARF for raspberry pi and other SBCs.
Stars: ✭ 50 (-64.29%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-64.29%)
oneshot-audioExperiment with "one-shot learning" techniques to recognize a voice signature
Stars: ✭ 22 (-84.29%)
VoicerAGI-server voice recognizer for #Asterisk
Stars: ✭ 73 (-47.86%)
Speaker-RecognitionThis repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Stars: ✭ 94 (-32.86%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-72.86%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-80.71%)
AutoSpeech[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Stars: ✭ 195 (+39.29%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+125.71%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-49.29%)
Voicebook🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Stars: ✭ 236 (+68.57%)
PLSCPaddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, DeiT, FaceViT.
Stars: ✭ 113 (-19.29%)
Project news alan aiIn this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.
Stars: ✭ 202 (+44.29%)
myprosodyA Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Stars: ✭ 162 (+15.71%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+30%)
QuietVRA Quiet Place in VR: Generate any 3D object with your voice. It's magic!
Stars: ✭ 17 (-87.86%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+6.43%)
FaceRecognitionCppLarge input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inference. 480P Over 30FPS on CPU
Stars: ✭ 40 (-71.43%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-26.43%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-45.71%)
Stayfit📱 🏃 🍎 Fitness application that’s used to keep track of your physical fitness data, daily calorie count, invite friends to work out together and ultimately get healthy.
Stars: ✭ 90 (-35.71%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-78.57%)
Node JuliusNode.js module for voice recognition using Julius
Stars: ✭ 69 (-50.71%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-75.71%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+500.71%)