Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+340.3%)

Mutual labels: speech, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+265.67%)

Mutual labels: speech, speech-synthesis

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-19.4%)

Mutual labels: speech, speech-synthesis

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+261.19%)

Mutual labels: speech, speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+25.37%)

Mutual labels: speech, speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+135.82%)

Mutual labels: speech, speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+631.34%)

Mutual labels: speech, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+440.3%)

Mutual labels: speech, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-22.39%)

Mutual labels: speech, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (+8.96%)

Mutual labels: speech, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-46.27%)

Mutual labels: speech, speech-synthesis

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (+105.97%)

Mutual labels: speech, speech-synthesis

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-2.99%)

Mutual labels: speech, speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-50.75%)

Mutual labels: speech, speech-synthesis

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-80.6%)

Mutual labels: speech

seeing-red

Using PPG Obtained via Smartphone Cameras for Authentication

Stars: ✭ 29 (-56.72%)

Mutual labels: biometrics

gtranscribe

Software for interview transcription

Stars: ✭ 12 (-82.09%)

Mutual labels: speech

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-71.64%)

Mutual labels: speech

Finished Senior LSC Python

Python implementation of LSC algorithm, (C) Zhengqin Li, Jiansheng Chen, 2014

Stars: ✭ 19 (-71.64%)

Mutual labels: cvpr

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (+16.42%)

Mutual labels: speech

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-17.91%)

Mutual labels: speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-38.81%)

Mutual labels: speech-synthesis

3-D-Scene-Graph

3D scene graph generator implemented in Pytorch.

Stars: ✭ 52 (-22.39%)

Mutual labels: 3d-models

awesome-citygml

The ultimate list of open data semantic city models

Stars: ✭ 57 (-14.93%)

Mutual labels: 3d-models

klatt-syn

Klatt formant synthesizer

Stars: ✭ 18 (-73.13%)

Mutual labels: speech-synthesis

learning2hash.github.io

Website for "A survey of learning to hash for Computer Vision" https://learning2hash.github.io

Stars: ✭ 14 (-79.1%)

Mutual labels: cvpr

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-79.1%)

Mutual labels: speech

ACGPN

"Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content"，CVPR 2020. (Modified from original with fixes for inference)

Stars: ✭ 48 (-28.36%)

Mutual labels: cvpr

StlVault

3D object viewer and organizer

Stars: ✭ 104 (+55.22%)

Mutual labels: 3d-models

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-40.3%)

Mutual labels: speech

StickMan-3D

StickMan 3D: First Round | indie fighting game | C++ OpenGL

Stars: ✭ 60 (-10.45%)

Mutual labels: 3d-models

building-editor

3D model editor for building/architecture

Stars: ✭ 24 (-64.18%)

Mutual labels: 3d-models

ppg-vc

PPG-Based Voice Conversion

Stars: ✭ 154 (+129.85%)

Mutual labels: speech-synthesis

DeepSegmentor

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Stars: ✭ 17 (-74.63%)

Mutual labels: speech

Modaily-Aware-Audio-Visual-Video-Parsing

Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing

Stars: ✭ 19 (-71.64%)

Mutual labels: cvpr

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+205.97%)

Mutual labels: speech-synthesis

HybrIK

Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021

Stars: ✭ 395 (+489.55%)

Mutual labels: cvpr

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+1155.22%)

Mutual labels: speech-synthesis

deep-learning-german-tts

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Stars: ✭ 268 (+300%)

Mutual labels: speech-synthesis

ExtensibleTTS-PyTorch

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Stars: ✭ 25 (-62.69%)

Mutual labels: speech-synthesis

Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG

Stars: ✭ 15 (-77.61%)

Mutual labels: speech-synthesis

Universal Head 3DMM

This is a Project Page of 'Towards a complete 3D morphable model of the human head'

Stars: ✭ 138 (+105.97%)

Mutual labels: 3dmm

redcube

JS renderer based on GLTF to WebGPU or WebGL backends.

Stars: ✭ 86 (+28.36%)

Mutual labels: 3d-models

data-at-hand-mobile

Mobile application for exploring fitness data using both speech and touch interaction.

Stars: ✭ 50 (-25.37%)

Mutual labels: speech

active-inference

A toy model of Friston's active inference in Tensorflow

Stars: ✭ 36 (-46.27%)

Mutual labels: cognitive-science

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-44.78%)

Mutual labels: speech

PolyDraw

✳️ PTSource PolyDraw is a free 3D polygonal modeller for Windows x86 and x64, for creating or modifying 3D objects using a mesh of 3D points and parametric NURBS Curves .Exports and imports to over 40 formats including WebVR and 3D Printing.

Stars: ✭ 17 (-74.63%)

Mutual labels: 3d-models

treegen

Vegetation Generation Tool for Houdini

Stars: ✭ 72 (+7.46%)

Mutual labels: 3d-models

Syn2Real

Repository for Transfer Learning using Deep CNNs trained with synthetic images

Stars: ✭ 16 (-76.12%)

Mutual labels: 3d-models

1-60 of 546 similar projects

›

next*5