All Projects → Awesome Speech Recognition Speech Synthesis Papers → Similar Projects or Alternatives

1978 Open source projects that are alternatives of or similar to Awesome Speech Recognition Speech Synthesis Papers

voicekit-examples

Examples on how to use Tinkoff Voicekit

Stars: ✭ 35 (-98.32%)

Mutual labels: speech-synthesis, speech-recognition

Handwritingrecognitionsystem

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Stars: ✭ 262 (-87.43%)

Mutual labels: cnn, rnn

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-98.94%)

Mutual labels: tts, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-86.62%)

Mutual labels: tts, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-96.45%)

Mutual labels: tts, speech-synthesis

Seq2seq chatbot

基于seq2seq模型的简单对话系统的tf实现，具有embedding、attention、beam_search等功能，数据集是Cornell Movie Dialogs

Stars: ✭ 308 (-85.23%)

Mutual labels: attention-mechanism, seq2seq

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-85.37%)

Mutual labels: attention-mechanism, speech-recognition

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-84.41%)

Mutual labels: tts, speech-synthesis

Seq2seq Summarizer

Pointer-generator reinforced seq2seq summarization in PyTorch

Stars: ✭ 306 (-85.32%)

Mutual labels: attention-mechanism, seq2seq

Numpy neural network

仅使用numpy从头开始实现神经网络,包括反向传播公式推导过程; numpy构建全连接层、卷积层、池化层、Flatten层；以及图像分类案例及精调网络案例等,持续更新中... ...

Stars: ✭ 339 (-83.74%)

Mutual labels: cnn, dnn

Text Classification Cnn Rnn

CNN-RNN中文文本分类，基于TensorFlow

Stars: ✭ 3,613 (+73.29%)

Mutual labels: cnn, rnn

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-83.02%)

Mutual labels: speech-synthesis, speech-recognition

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+310.55%)

Mutual labels: speech-synthesis, speech-recognition

Tacotron Wavernn

TTS (Tacotron + WaveRNN)

Stars: ✭ 40 (-98.08%)

Mutual labels: dnn, tts

Cortex M Kws

Cortex M KWS example with Tengine Lite.

Stars: ✭ 45 (-97.84%)

Mutual labels: cnn, speech-recognition

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-97.75%)

Mutual labels: cnn, speech-recognition

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (-82.06%)

Mutual labels: language-model, speech-recognition

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-76.5%)

Mutual labels: speech-synthesis, speech-recognition

Transformer Tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Stars: ✭ 418 (-79.95%)

Mutual labels: attention-mechanism, tts

Char rnn lm zh

language model in Chinese，基于Pytorch官方文档实现

Stars: ✭ 57 (-97.27%)

Mutual labels: rnn, language-model

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+117.41%)

Mutual labels: speech-synthesis, speech-recognition

Multi Class Text Classification Cnn Rnn

Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.

Stars: ✭ 570 (-72.66%)

Mutual labels: cnn, rnn

Awesome Bert Nlp

A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.

Stars: ✭ 567 (-72.81%)

Mutual labels: attention-mechanism, language-model

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (-18.51%)

Mutual labels: tts, speech-synthesis

How To Learn Deep Learning

A top-down, practical guide to learn AI, Deep learning and Machine Learning.

Stars: ✭ 544 (-73.91%)

Mutual labels: cnn, rnn

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (-63.36%)

Mutual labels: cnn, speech-recognition

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (-63.74%)

Mutual labels: language-model, speech-recognition

Seq2seq chatbot new

基于seq2seq模型的简单对话系统的tf实现，具有embedding、attention、beam_search等功能，数据集是Cornell Movie Dialogs

Stars: ✭ 144 (-93.09%)

Mutual labels: attention-mechanism, seq2seq

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-96.5%)

Mutual labels: tts, speech-synthesis

Rnn Theano

使用Theano实现的一些RNN代码，包括最基本的RNN，LSTM，以及部分Attention模型，如论文MLSTM等

Stars: ✭ 31 (-98.51%)

Mutual labels: cnn, rnn

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-98.51%)

Mutual labels: tts, speech-synthesis

Boilerplate Dynet Rnn Lm

Boilerplate code for quickly getting set up to run language modeling experiments

Stars: ✭ 37 (-98.23%)

Mutual labels: rnn, language-model

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-98.66%)

Mutual labels: tts, speech-synthesis

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+434.82%)

Mutual labels: speech-recognition, speaker-verification

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-98.08%)

Mutual labels: cnn, dnn

Py Nltools

A collection of basic python modules for spoken natural language processing

Stars: ✭ 46 (-97.79%)

Mutual labels: tts, speech-recognition

Ailearning

AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP

Stars: ✭ 32,316 (+1449.93%)

Mutual labels: rnn, dnn

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-97.51%)

Mutual labels: tts, speech-synthesis

Mxnet Seq2seq

Sequence to sequence learning with MXNET

Stars: ✭ 51 (-97.55%)

Mutual labels: rnn, seq2seq

Deepseqslam

The Official Deep Learning Framework for Route-based Place Recognition

Stars: ✭ 49 (-97.65%)

Mutual labels: cnn, rnn

Speech ai

Simple speech linguistic AI with Python

Stars: ✭ 66 (-96.83%)

Mutual labels: speech-synthesis, speech-recognition

Nlp overview

Overview of Modern Deep Learning Techniques Applied to Natural Language Processing

Stars: ✭ 1,104 (-47.05%)

Mutual labels: cnn, rnn

Patter

speech-to-text in pytorch

Stars: ✭ 71 (-96.59%)

Mutual labels: rnn, speech-recognition

Sleepeegnet

SleepEEGNet: Automated Sleep Stage Scoring with Sequence to Sequence Deep Learning Approach

Stars: ✭ 89 (-95.73%)

Mutual labels: cnn, rnn

Attend infer repeat

A Tensorfflow implementation of Attend, Infer, Repeat

Stars: ✭ 82 (-96.07%)

Mutual labels: rnn, attention-mechanism

Speech Emotion Recognition

Detecting emotions using MFCC features of human speech using Deep Learning

Stars: ✭ 89 (-95.73%)

Mutual labels: rnn, speech-recognition

Cross vc

Cross-lingual Voice Conversion

Stars: ✭ 91 (-95.64%)

Mutual labels: speech-synthesis, speech-recognition

Attention Ocr

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Stars: ✭ 844 (-59.52%)

Mutual labels: cnn, seq2seq

Parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.

Stars: ✭ 48 (-97.7%)

Mutual labels: tts, speech-recognition

Cnn vocoder

A fast cnn-based vocoder

Stars: ✭ 74 (-96.45%)

Mutual labels: tts, speech-synthesis

Cnn lstm for text classify

CNN, LSTM, NBOW, fasttext 中文文本分类

Stars: ✭ 90 (-95.68%)

Mutual labels: cnn, rnn

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+14.24%)

Mutual labels: tts, speech-synthesis

Codesearchnet

Datasets, tools, and benchmarks for representation learning of code.

Stars: ✭ 1,378 (-33.91%)

Mutual labels: cnn, rnn

Kaggle Web Traffic

1st place solution

Stars: ✭ 1,641 (-21.29%)

Mutual labels: rnn, seq2seq

Captcharecognition

End-to-end variable length Captcha recognition using CNN+RNN+Attention/CTC (pytorch implementation). 端到端的不定长验证码识别