All Projects → Awesome Speech Recognition Speech Synthesis Papers → Similar Projects or Alternatives

1978 Open source projects that are alternatives of or similar to Awesome Speech Recognition Speech Synthesis Papers

voicekit-examples
Examples on how to use Tinkoff Voicekit
Stars: ✭ 35 (-98.32%)
Handwritingrecognitionsystem
Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture
Stars: ✭ 262 (-87.43%)
Mutual labels:  cnn, rnn
Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-98.94%)
Mutual labels:  tts, speech-synthesis
Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-86.62%)
Mutual labels:  tts, speech-synthesis
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-96.45%)
Mutual labels:  tts, speech-synthesis
Seq2seq chatbot
基于seq2seq模型的简单对话系统的tf实现,具有embedding、attention、beam_search等功能,数据集是Cornell Movie Dialogs
Stars: ✭ 308 (-85.23%)
Mutual labels:  attention-mechanism, seq2seq
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (-85.37%)
Hifi Gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-84.41%)
Mutual labels:  tts, speech-synthesis
Seq2seq Summarizer
Pointer-generator reinforced seq2seq summarization in PyTorch
Stars: ✭ 306 (-85.32%)
Mutual labels:  attention-mechanism, seq2seq
Numpy neural network
仅使用numpy从头开始实现神经网络,包括反向传播公式推导过程; numpy构建全连接层、卷积层、池化层、Flatten层;以及图像分类案例及精调网络案例等,持续更新中... ...
Stars: ✭ 339 (-83.74%)
Mutual labels:  cnn, dnn
Text Classification Cnn Rnn
CNN-RNN中文文本分类,基于TensorFlow
Stars: ✭ 3,613 (+73.29%)
Mutual labels:  cnn, rnn
Libfaceid
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-83.02%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+310.55%)
Tacotron Wavernn
TTS (Tacotron + WaveRNN)
Stars: ✭ 40 (-98.08%)
Mutual labels:  dnn, tts
Cortex M Kws
Cortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-97.84%)
Mutual labels:  cnn, speech-recognition
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-97.75%)
Mutual labels:  cnn, speech-recognition
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-82.06%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-76.5%)
Transformer Tts
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Stars: ✭ 418 (-79.95%)
Mutual labels:  attention-mechanism, tts
Char rnn lm zh
language model in Chinese,基于Pytorch官方文档实现
Stars: ✭ 57 (-97.27%)
Mutual labels:  rnn, language-model
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+117.41%)
Multi Class Text Classification Cnn Rnn
Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.
Stars: ✭ 570 (-72.66%)
Mutual labels:  cnn, rnn
Awesome Bert Nlp
A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
Stars: ✭ 567 (-72.81%)
Marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: ✭ 1,699 (-18.51%)
Mutual labels:  tts, speech-synthesis
How To Learn Deep Learning
A top-down, practical guide to learn AI, Deep learning and Machine Learning.
Stars: ✭ 544 (-73.91%)
Mutual labels:  cnn, rnn
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (-63.36%)
Mutual labels:  cnn, speech-recognition
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (-63.74%)
Seq2seq chatbot new
基于seq2seq模型的简单对话系统的tf实现,具有embedding、attention、beam_search等功能,数据集是Cornell Movie Dialogs
Stars: ✭ 144 (-93.09%)
Mutual labels:  attention-mechanism, seq2seq
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-96.5%)
Mutual labels:  tts, speech-synthesis
Rnn Theano
使用Theano实现的一些RNN代码,包括最基本的RNN,LSTM,以及部分Attention模型,如论文MLSTM等
Stars: ✭ 31 (-98.51%)
Mutual labels:  cnn, rnn
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-98.51%)
Mutual labels:  tts, speech-synthesis
Boilerplate Dynet Rnn Lm
Boilerplate code for quickly getting set up to run language modeling experiments
Stars: ✭ 37 (-98.23%)
Mutual labels:  rnn, language-model
Jsut Lab
HTS-style full-context labels for JSUT v1.1
Stars: ✭ 28 (-98.66%)
Mutual labels:  tts, speech-synthesis
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+434.82%)
Dialectid e2e
End to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-98.08%)
Mutual labels:  cnn, dnn
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-97.79%)
Mutual labels:  tts, speech-recognition
Ailearning
AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Stars: ✭ 32,316 (+1449.93%)
Mutual labels:  rnn, dnn
Cs224n Gpu That Talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-97.51%)
Mutual labels:  tts, speech-synthesis
Mxnet Seq2seq
Sequence to sequence learning with MXNET
Stars: ✭ 51 (-97.55%)
Mutual labels:  rnn, seq2seq
Deepseqslam
The Official Deep Learning Framework for Route-based Place Recognition
Stars: ✭ 49 (-97.65%)
Mutual labels:  cnn, rnn
Speech ai
Simple speech linguistic AI with Python
Stars: ✭ 66 (-96.83%)
Nlp overview
Overview of Modern Deep Learning Techniques Applied to Natural Language Processing
Stars: ✭ 1,104 (-47.05%)
Mutual labels:  cnn, rnn
Patter
speech-to-text in pytorch
Stars: ✭ 71 (-96.59%)
Mutual labels:  rnn, speech-recognition
Sleepeegnet
SleepEEGNet: Automated Sleep Stage Scoring with Sequence to Sequence Deep Learning Approach
Stars: ✭ 89 (-95.73%)
Mutual labels:  cnn, rnn
Attend infer repeat
A Tensorfflow implementation of Attend, Infer, Repeat
Stars: ✭ 82 (-96.07%)
Mutual labels:  rnn, attention-mechanism
Speech Emotion Recognition
Detecting emotions using MFCC features of human speech using Deep Learning
Stars: ✭ 89 (-95.73%)
Mutual labels:  rnn, speech-recognition
Cross vc
Cross-lingual Voice Conversion
Stars: ✭ 91 (-95.64%)
Attention Ocr
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
Stars: ✭ 844 (-59.52%)
Mutual labels:  cnn, seq2seq
Parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-97.7%)
Mutual labels:  tts, speech-recognition
Cnn vocoder
A fast cnn-based vocoder
Stars: ✭ 74 (-96.45%)
Mutual labels:  tts, speech-synthesis
Cnn lstm for text classify
CNN, LSTM, NBOW, fasttext 中文文本分类
Stars: ✭ 90 (-95.68%)
Mutual labels:  cnn, rnn
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+14.24%)
Mutual labels:  tts, speech-synthesis
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-33.91%)
Mutual labels:  cnn, rnn
Kaggle Web Traffic
1st place solution
Stars: ✭ 1,641 (-21.29%)
Mutual labels:  rnn, seq2seq
Captcharecognition
End-to-end variable length Captcha recognition using CNN+RNN+Attention/CTC (pytorch implementation). 端到端的不定长验证码识别
Stars: ✭ 97 (-95.35%)
Mutual labels:  cnn, rnn
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (-95.01%)
Mutual labels:  seq2seq, speech-synthesis
Adnet
Attention-guided CNN for image denoising(Neural Networks,2020)
Stars: ✭ 135 (-93.53%)
Mutual labels:  cnn, attention-mechanism
Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (-21.53%)
Mutual labels:  tts, speech-synthesis
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (-94.82%)
Mutual labels:  tts, speech-synthesis
Pytorch Learners Tutorial
PyTorch tutorial for learners
Stars: ✭ 97 (-95.35%)
Mutual labels:  cnn, rnn
61-120 of 1978 similar projects