All Projects → Tacotron2-PyTorch → Similar Projects or Alternatives

391 Open source projects that are alternatives of or similar to Tacotron2-PyTorch

Merlin
This is now the official location of the Merlin project.
Stars: ✭ 1,168 (+889.83%)
Mutual labels:  text-to-speech
Skip Thoughts.torch
Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7
Stars: ✭ 146 (+23.73%)
Mutual labels:  pretrained-models
mlp-singer
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
Stars: ✭ 103 (-12.71%)
Mutual labels:  text-to-speech
Tod Bert
Pre-Trained Models for ToD-BERT
Stars: ✭ 143 (+21.19%)
Mutual labels:  pretrained-models
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+849.15%)
Mutual labels:  text-to-speech
Nnsplit
Semantic text segmentation. For sentence boundary detection, compound splitting and more.
Stars: ✭ 141 (+19.49%)
Mutual labels:  pretrained-models
Breast density classifier
Breast density classification with deep convolutional neural networks
Stars: ✭ 137 (+16.1%)
Mutual labels:  pretrained-models
Electra
中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model
Stars: ✭ 132 (+11.86%)
Mutual labels:  pretrained-models
ganimation replicate
An Out-of-the-Box Replication of GANimation using PyTorch, pretrained weights are available!
Stars: ✭ 165 (+39.83%)
Mutual labels:  pretrained-models
arguing-robots
🤖 Watch and hear macOS robots argue live in your terminal 🤖
Stars: ✭ 53 (-55.08%)
Mutual labels:  text-to-speech
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (+33.9%)
Mutual labels:  text-to-speech
Mrcp Plugin With Freeswitch
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Stars: ✭ 168 (+42.37%)
Mutual labels:  tts
Covid Twitter Bert
Pretrained BERT model for analysing COVID-19 Twitter data
Stars: ✭ 101 (-14.41%)
Mutual labels:  pretrained-models
Friend.ly
A social media platform with a friend recommendation engine based on personality trait extraction
Stars: ✭ 41 (-65.25%)
Mutual labels:  text-to-speech
gap-text2sql
GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
Stars: ✭ 83 (-29.66%)
Mutual labels:  pretrained-models
Melnet
Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Stars: ✭ 161 (+36.44%)
Mutual labels:  tts
Tts Papers
🐸 collection of TTS papers
Stars: ✭ 160 (+35.59%)
Mutual labels:  tts
Dialogue Understanding
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Stars: ✭ 77 (-34.75%)
Mutual labels:  pretrained-models
Asrgen
Attacking Speaker Recognition with Deep Generative Models
Stars: ✭ 31 (-73.73%)
Mutual labels:  text-to-speech
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-82.2%)
Mutual labels:  pretrained-models
Gpt2 Ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Stars: ✭ 1,066 (+803.39%)
Mutual labels:  pretrained-models
Pyannote Audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Stars: ✭ 978 (+728.81%)
Mutual labels:  pretrained-models
basecls
A codebase & model zoo for pretrained backbone based on MegEngine.
Stars: ✭ 29 (-75.42%)
Mutual labels:  pretrained-models
Classification models
Classification models trained on ImageNet. Keras.
Stars: ✭ 938 (+694.92%)
Mutual labels:  pretrained-models
Vonage Php Sdk Core
Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 849 (+619.49%)
Mutual labels:  text-to-speech
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-54.24%)
Mutual labels:  text-to-speech
Automatic Youtube Reddit Text To Speech Video Generator And Uploader
A series of 3 programs that will automatically receive scripts from Reddit, allow the user to edit them, then be sent off to a video generator where they will be uploaded to YouTube automatically.
Stars: ✭ 152 (+28.81%)
Mutual labels:  tts
Bert Keras
Keras implementation of BERT with pre-trained weights
Stars: ✭ 820 (+594.92%)
Mutual labels:  pretrained-models
Nonparaseq2seqvc code
Implementation code of non-parallel sequence-to-sequence VC
Stars: ✭ 154 (+30.51%)
Mutual labels:  text-to-speech
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+1666.95%)
Mutual labels:  tts
Pytorchinsight
a pytorch lib with state-of-the-art architectures, pretrained models and real-time updated results
Stars: ✭ 713 (+504.24%)
Mutual labels:  pretrained-models
node-red-contrib-yandex-station-management
Модуль node-red-contrib-yandex-station-management для управления умными колонками от Яндекс
Stars: ✭ 20 (-83.05%)
Mutual labels:  tts
Conv Emotion
This repo contains implementation of different architectures for emotion recognition in conversations.
Stars: ✭ 646 (+447.46%)
Mutual labels:  pretrained-models
cookietts
TTS from Cookie. Messy and experimental!
Stars: ✭ 29 (-75.42%)
Mutual labels:  tacotron2
Tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+1388.14%)
Mutual labels:  tts
Dla
Deep learning for audio processing
Stars: ✭ 142 (+20.34%)
Mutual labels:  tts
Pinto model zoo
A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
Stars: ✭ 634 (+437.29%)
Mutual labels:  pretrained-models
Breast cancer classifier
Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening
Stars: ✭ 614 (+420.34%)
Mutual labels:  pretrained-models
Self Driving Car In Video Games
A deep neural network that learns to drive in video games
Stars: ✭ 559 (+373.73%)
Mutual labels:  pretrained-models
masr
中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Stars: ✭ 179 (+51.69%)
Mutual labels:  pretrained-models
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+342.37%)
Mutual labels:  pretrained-models
Ha Tts Bluetooth Speaker
TTS Bluetooth Speaker for Home Assistant
Stars: ✭ 140 (+18.64%)
Mutual labels:  tts
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+1301.69%)
Mutual labels:  tts
Bert Multitask Learning
BERT for Multitask Learning
Stars: ✭ 380 (+222.03%)
Mutual labels:  pretrained-models
Google Speech V2
💬 Reverse Engineering Google's Speech To Text API (v2)
Stars: ✭ 435 (+268.64%)
Mutual labels:  text-to-speech
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-70.34%)
Mutual labels:  text-to-speech
Midi2voice
Singing synthesis from MIDI file
Stars: ✭ 102 (-13.56%)
Mutual labels:  tts
My Appdaemon
My apps, my helpfiles, all about AppDaemon for Home Assistant
Stars: ✭ 94 (-20.34%)
Mutual labels:  tts
Pytorch Human Pose Estimation
Implementation of various human pose estimation models in pytorch on multiple datasets (MPII & COCO) along with pretrained models
Stars: ✭ 346 (+193.22%)
Mutual labels:  pretrained-models
Alan Sdk Web
Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (+211.86%)
Mutual labels:  text-to-speech
Pitchtron
TTS for pitch-accented language. Korean dialect DB.
Stars: ✭ 91 (-22.88%)
Mutual labels:  tts
Tts
Tools to convert text to speech 📚💬
Stars: ✭ 84 (-28.81%)
Mutual labels:  tts
concurrent-video-analytic-pipeline-optimization-sample-l
Create a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.
Stars: ✭ 39 (-66.95%)
Mutual labels:  pretrained-models
One-Shot-Voice-Cloning
☺️ One Shot Voice Cloning base on Unet-TTS
Stars: ✭ 118 (+0%)
Mutual labels:  tts
WARP
Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming. Outperforming `GPT-3` on SuperGLUE Few-Shot text classification. https://aclanthology.org/2021.acl-long.381/
Stars: ✭ 66 (-44.07%)
Mutual labels:  pretrained-models
open clip
An open source implementation of CLIP.
Stars: ✭ 1,534 (+1200%)
Mutual labels:  pretrained-models
bangla-tts
Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library
Stars: ✭ 61 (-48.31%)
Mutual labels:  text-to-speech
java-google-speech-api
🙊 Speech Recognition , Text To Speech , Google Translate
Stars: ✭ 67 (-43.22%)
Mutual labels:  text-to-speech
Hassio Addons
The repository for my Home Assistant Supervisor Add-ons.
Stars: ✭ 71 (-39.83%)
Mutual labels:  tts
Cnn vocoder
A fast cnn-based vocoder
Stars: ✭ 74 (-37.29%)
Mutual labels:  tts
301-360 of 391 similar projects