PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (-9.74%)

Mutual labels: speech-synthesis

QPPWG

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Stars: ✭ 41 (-73.38%)

Mutual labels: speech-synthesis

audioslides.io

Use Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.

Stars: ✭ 19 (-87.66%)

Mutual labels: speech-synthesis

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-57.79%)

Mutual labels: speech-synthesis

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+446.1%)

Mutual labels: speech-synthesis

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-67.53%)

Mutual labels: speech-synthesis

wiki2ssml

Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.

Stars: ✭ 31 (-79.87%)

Mutual labels: speech-synthesis

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (-83.77%)

Mutual labels: speech-synthesis

sova-tts-tps

NLP-preprocessor for the SOVA-TTS project

Stars: ✭ 44 (-71.43%)

Mutual labels: speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+941.56%)

Mutual labels: speech-synthesis

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-64.94%)

Mutual labels: speech-synthesis

MixPath

MixPath: A Unified Approach for One-shot Neural Architecture Search

Stars: ✭ 29 (-81.17%)

Mutual labels: one-shot

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (-57.14%)

Mutual labels: speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-65.58%)

Mutual labels: speech-synthesis

TinyCog

Small Robot, Toy Robot platform

Stars: ✭ 29 (-81.17%)

Mutual labels: speech-synthesis

GlottDNN

GlottDNN vocoder and tools for training DNN excitation models

Stars: ✭ 30 (-80.52%)

Mutual labels: speech-synthesis

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-88.31%)

Mutual labels: speech-synthesis

sam

Software Automatic Mouth - Tiny Speech Synthesizer

Stars: ✭ 316 (+105.19%)

Mutual labels: speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-45.45%)

Mutual labels: speech-synthesis

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+1448.05%)

Mutual labels: conformer

systole

Systole: A python package for cardiac signal synchrony and analysis

Stars: ✭ 51 (-66.88%)

Mutual labels: ppg

ttsflow

tensorflow speech synthesis c++ inference for voicenet

Stars: ✭ 17 (-88.96%)

Mutual labels: speech-synthesis

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Stars: ✭ 45 (-70.78%)

Mutual labels: voice-conversion

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-77.27%)

Mutual labels: speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-73.38%)

Mutual labels: speech-synthesis

CGCF-ConfGen

🧪 Learning Neural Generative Dynamics for Molecular Conformation Generation (ICLR 2021)

Stars: ✭ 41 (-73.38%)

Mutual labels: conformer

AFE4490 Oximeter

This pulse oximetry shield from ProtoCentral uses the AFE4490 IC to enable your Arduino to measure heart rate as well as SpO2 values.

Stars: ✭ 39 (-74.68%)

Mutual labels: ppg

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (+196.1%)

Mutual labels: conformer

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-87.01%)

Mutual labels: voice-conversion

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-85.71%)

Mutual labels: voice-conversion

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-88.31%)

Mutual labels: voice-conversion

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Stars: ✭ 117 (-24.03%)

Mutual labels: speech-synthesis

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+33.12%)

Mutual labels: speech-synthesis

Upcharika

A unique flutter application aimed at helping people getting their vitals using Photoplethysmography and Computer Vision

Stars: ✭ 37 (-75.97%)

Mutual labels: ppg

ml-with-audio

HF's ML for Audio study group

Stars: ✭ 104 (-32.47%)

Mutual labels: speech-synthesis

NanoFlow

PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)

Stars: ✭ 63 (-59.09%)

Mutual labels: speech-synthesis

PPG

Code to estimate HR from PPG signals using Subspace Decomposition and Kalman filter for the dataset of 22 PPG recordings provided for the 2015 IEEE Signal Processing Cup (SP Cup) competition. The traces are stored in folder 'DATABASE'. Please cite this publication when referencing this material: "Measuring Heart Rate During Physical Exercise by …

Stars: ✭ 43 (-72.08%)

Mutual labels: ppg

sova-tts-engine

Tacotron2 based engine for the SOVA-TTS project

Stars: ✭ 63 (-59.09%)

Mutual labels: speech-synthesis

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (-61.04%)

Mutual labels: speech-synthesis

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+91.56%)

Mutual labels: speech-synthesis

deep-learning-german-tts

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Stars: ✭ 268 (+74.03%)

Mutual labels: speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+59.09%)

Mutual labels: speech-synthesis

voder

An emulation of the Voder Speech Synthesizer.

Stars: ✭ 19 (-87.66%)

Mutual labels: speech-synthesis

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-82.47%)

Mutual labels: speech-synthesis

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Stars: ✭ 17 (-88.96%)

Mutual labels: speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (-30.52%)

Mutual labels: speech-synthesis

Voice-Conversion

No description or website provided.

Stars: ✭ 30 (-80.52%)

Mutual labels: voice-conversion

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+57.14%)

Mutual labels: speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration