TermitTranslations with speech synthesis in your terminal as a ruby gem
Stars: ✭ 505 (+260.71%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+250%)
AutovcAutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Stars: ✭ 485 (+246.43%)
GanttsPyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Stars: ✭ 460 (+228.57%)
SprocketVoice Conversion Tool Kit
Stars: ✭ 425 (+203.57%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+3137.86%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+152.86%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+112.14%)
voice-conversionan tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-81.43%)
EmotionalConversionStarGANThis repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Stars: ✭ 92 (-34.29%)
Meta-TTSOfficial repository of https://arxiv.org/abs/2111.04040v1
Stars: ✭ 69 (-50.71%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-83.57%)
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-86.43%)
mimic2Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (+283.57%)
Voice2MeshCVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (-52.14%)
ExtensibleTTS-PyTorchAn extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
Stars: ✭ 25 (-82.14%)
ppg-vcPPG-Based Voice Conversion
Stars: ✭ 154 (+10%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-79.29%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+46.43%)
sova-tts-tpsNLP-preprocessor for the SOVA-TTS project
Stars: ✭ 44 (-68.57%)
QPPWGQuasi-Periodic Parallel WaveGAN Pytorch implementation
Stars: ✭ 41 (-70.71%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (-25.71%)
few-shot-transformer-ttsByte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Stars: ✭ 60 (-57.14%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-87.14%)
MediumVCAny-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (-67.14%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-87.14%)
cookiettsTTS from Cookie. Messy and experimental!
Stars: ✭ 29 (-79.29%)