This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

Stars: ✭ 3,307 (-27.05%)

Mutual labels: machine-translation

asr24

24-hour Automatic Speech Recognition

Stars: ✭ 27 (-99.4%)

Mutual labels: kaldi

vcc20 baseline cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

Stars: ✭ 123 (-97.29%)

Mutual labels: voice-conversion

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-95.48%)

Mutual labels: speech-recognition

htk

HTK Toolkit with Linux 64 bit and Docker support

Stars: ✭ 14 (-99.69%)

Mutual labels: speech-recognition

wiki2ssml

Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.

Stars: ✭ 31 (-99.32%)

Mutual labels: speech-synthesis

deep-learning-platforms

deep-learning platforms,framework,data（深度学习平台、框架、资料）

Stars: ✭ 17 (-99.62%)

Mutual labels: chainer

good-speech-web-client

Practice your speech level in any language using speech recognition

Stars: ✭ 26 (-99.43%)

Mutual labels: speech-recognition

Pytorchwavenetvocoder

WaveNet-Vocoder implementation with pytorch.

Stars: ✭ 269 (-94.07%)

Mutual labels: speech-synthesis

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (-98.81%)

Mutual labels: speech-recognition

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-99.56%)

Mutual labels: voice-conversion

sova-tts-engine

Tacotron2 based engine for the SOVA-TTS project

Stars: ✭ 63 (-98.61%)

Mutual labels: speech-synthesis

sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

Stars: ✭ 45 (-99.01%)

Mutual labels: speech-recognition

quickstart-examples

Integration examples of Tanker's client-side encryption SDKs

Stars: ✭ 17 (-99.62%)

Mutual labels: end-to-end

Calculate-SNR-SDR

Script to calculate SNR and SDR using python

Stars: ✭ 76 (-98.32%)

Mutual labels: speech-separation

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-92.83%)

Mutual labels: speech-synthesis

GlottDNN

GlottDNN vocoder and tools for training DNN excitation models

Stars: ✭ 30 (-99.34%)

Mutual labels: speech-synthesis

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-99.54%)

Mutual labels: speech-recognition

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (-93.49%)

Mutual labels: speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-98.39%)

Mutual labels: speech-synthesis

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-99.54%)

Mutual labels: speech-recognition

char-rnn-text-generation

Character Embeddings Recurrent Neural Network Text Generation Models