All Projects → Espnet → Similar Projects or Alternatives

717 Open source projects that are alternatives of or similar to Espnet

OPUS-MT-train
Training open neural machine translation models
Stars: ✭ 166 (-96.34%)
Mutual labels:  machine-translation
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-98.37%)
Mutual labels:  speech-synthesis
speech-enhancement-WGAN
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (-99.23%)
Mutual labels:  speech-enhancement
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (-98.48%)
Mutual labels:  speech-recognition
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (-93.89%)
Mutual labels:  speech-recognition
SpleeterRT
Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (-97.55%)
Mutual labels:  speech-enhancement
speech-to-text-code-pattern
React app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-99.18%)
Mutual labels:  speech-recognition
kaldi-python-io
A python IO interface for data accessing in kaldi
Stars: ✭ 39 (-99.14%)
Mutual labels:  kaldi
download audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-98.83%)
Mutual labels:  speech-recognition
License-plate-recognition
使用 "Darknet yolov3-tiny" 进行车牌识别
Stars: ✭ 90 (-98.01%)
Mutual labels:  end-to-end
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (-99.65%)
Mutual labels:  speech-recognition
gravity
User-space deniable data encryption client.
Stars: ✭ 89 (-98.04%)
Mutual labels:  end-to-end
ilmulti
Tooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-99.58%)
Mutual labels:  machine-translation
sepia-docs
Documentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (-96.47%)
Mutual labels:  speech-recognition
sova-tts-tps
NLP-preprocessor for the SOVA-TTS project
Stars: ✭ 44 (-99.03%)
Mutual labels:  speech-synthesis
ocaml-otr
Off-the-record (OTR) messaging protocol, purely in OCaml
Stars: ✭ 39 (-99.14%)
Mutual labels:  end-to-end
bergamot-translator
Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.
Stars: ✭ 181 (-96.01%)
Mutual labels:  machine-translation
Tensorflow-Keyword-Spotting
Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-99.4%)
Mutual labels:  speech-recognition
superresolution gan
Chainer implementation of Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Stars: ✭ 50 (-98.9%)
Mutual labels:  chainer
ChainerPruner
ChainerPruner: Channel Pruning framework for Chainer
Stars: ✭ 21 (-99.54%)
Mutual labels:  chainer
speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-99.12%)
Mutual labels:  end-to-end
NanoFlow
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
Stars: ✭ 63 (-98.61%)
Mutual labels:  speech-synthesis
Attention-Visualization
Visualization for simple attention and Google's multi-head attention.
Stars: ✭ 54 (-98.81%)
Mutual labels:  machine-translation
captioning chainer
A fast implementation of Neural Image Caption by Chainer
Stars: ✭ 17 (-99.62%)
Mutual labels:  chainer
subtitles-view
基于javaFX的简单字幕处理桌面程序,集成在线翻译及语音转换
Stars: ✭ 368 (-91.88%)
Mutual labels:  voice-conversion
QPPWG
Quasi-Periodic Parallel WaveGAN Pytorch implementation
Stars: ✭ 41 (-99.1%)
Mutual labels:  speech-synthesis
chainer-notebooks
Jupyter notebooks for Chainer hands-on
Stars: ✭ 23 (-99.49%)
Mutual labels:  chainer
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-96.05%)
Mutual labels:  speech-recognition
Zhihu
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (-27.05%)
Mutual labels:  machine-translation
asr24
24-hour Automatic Speech Recognition
Stars: ✭ 27 (-99.4%)
Mutual labels:  kaldi
vcc20 baseline cyclevae
Voice Conversion Challenge 2020 CycleVAE baseline system
Stars: ✭ 123 (-97.29%)
Mutual labels:  voice-conversion
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-95.48%)
Mutual labels:  speech-recognition
htk
HTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-99.69%)
Mutual labels:  speech-recognition
wiki2ssml
Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Stars: ✭ 31 (-99.32%)
Mutual labels:  speech-synthesis
deep-learning-platforms
deep-learning platforms,framework,data(深度学习平台、框架、资料)
Stars: ✭ 17 (-99.62%)
Mutual labels:  chainer
good-speech-web-client
Practice your speech level in any language using speech recognition
Stars: ✭ 26 (-99.43%)
Mutual labels:  speech-recognition
Pytorchwavenetvocoder
WaveNet-Vocoder implementation with pytorch.
Stars: ✭ 269 (-94.07%)
Mutual labels:  speech-synthesis
obvi
A Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-98.81%)
Mutual labels:  speech-recognition
JD-NMF
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-99.56%)
Mutual labels:  voice-conversion
sova-tts-engine
Tacotron2 based engine for the SOVA-TTS project
Stars: ✭ 63 (-98.61%)
Mutual labels:  speech-synthesis
sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (-99.01%)
Mutual labels:  speech-recognition
quickstart-examples
Integration examples of Tanker's client-side encryption SDKs
Stars: ✭ 17 (-99.62%)
Mutual labels:  end-to-end
Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Stars: ✭ 76 (-98.32%)
Mutual labels:  speech-separation
Hifi Gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-92.83%)
Mutual labels:  speech-synthesis
GlottDNN
GlottDNN vocoder and tools for training DNN excitation models
Stars: ✭ 30 (-99.34%)
Mutual labels:  speech-synthesis
revai-node-sdk
Node.js SDK for the Rev AI API
Stars: ✭ 21 (-99.54%)
Mutual labels:  speech-recognition
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (-93.49%)
Mutual labels:  speech-synthesis
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-98.39%)
Mutual labels:  speech-synthesis
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-99.54%)
Mutual labels:  speech-recognition
char-rnn-text-generation
Character Embeddings Recurrent Neural Network Text Generation Models
Stars: ✭ 64 (-98.59%)
Mutual labels:  chainer
Portrait matting
Implementation of "Automatic Portrait Segmentation" and "Deep Automatic Portrait Matting" with Chainer.
Stars: ✭ 267 (-94.11%)
Mutual labels:  chainer
fdndlp
A speech dereverberation algorithm, also called wpe
Stars: ✭ 115 (-97.46%)
Mutual labels:  speech-enhancement
LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: ✭ 67 (-98.52%)
Mutual labels:  speech-synthesis
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-98.43%)
Mutual labels:  speech-recognition
Sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (-93.54%)
Mutual labels:  machine-translation
inv rl
Inverse Reinforcement Learning Argorithms
Stars: ✭ 34 (-99.25%)
Mutual labels:  chainer
apertium-html-tools
Web application providing a fully localised interface for text/website/document translation, analysis and generation powered by Apertium.
Stars: ✭ 36 (-99.21%)
Mutual labels:  machine-translation
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (-99.6%)
Mutual labels:  voice-conversion
neural style synthesizer
No description or website provided.
Stars: ✭ 15 (-99.67%)
Mutual labels:  chainer
VoiceBridge
VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-99.62%)
Mutual labels:  speech-recognition
301-360 of 717 similar projects