All Projects → Espnet → Similar Projects or Alternatives

717 Open source projects that are alternatives of or similar to Espnet

kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-99.45%)
Mutual labels:  end-to-end, speech-recognition
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (-70.06%)
Mutual labels:  speech-recognition, kaldi
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (-96.14%)
Mutual labels:  speech-recognition, end-to-end
Kaldiio
A pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (-96.47%)
Mutual labels:  speech-recognition, kaldi
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-69.6%)
kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-89.94%)
Mutual labels:  end-to-end, speech-recognition
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-99.23%)
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-95.68%)
Mutual labels:  speech-recognition, kaldi
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-95.06%)
NLP Toolkit
Library of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-97.97%)
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-99.54%)
Mutual labels:  end-to-end, speech-recognition
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (-99.56%)
Mutual labels:  end-to-end, speech-recognition
TinyCog
Small Robot, Toy Robot platform
Stars: ✭ 29 (-99.36%)
Speech ai
Simple speech linguistic AI with Python
Stars: ✭ 66 (-98.54%)
Vosk Android Demo
Offline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-94.02%)
Mutual labels:  speech-recognition, kaldi
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (-97.71%)
Mutual labels:  speech-synthesis, end-to-end
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-91.75%)
Mutual labels:  speech-recognition, kaldi
Wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+30.31%)
Mutual labels:  speech-recognition, end-to-end
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-88.04%)
vosk-model-ru-adaptation
No description or website provided.
Stars: ✭ 19 (-99.58%)
Mutual labels:  speech-recognition, kaldi
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (-54%)
Deep Learning Drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+114.36%)
Speech Transformer Tf2.0
transformer for ASR-systerm (via tensorflow2.0)
Stars: ✭ 90 (-98.01%)
Mutual labels:  speech-recognition, end-to-end
E2e Asr
PyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-97.66%)
Mutual labels:  speech-recognition, end-to-end
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-99.6%)
ppg-vc
PPG-Based Voice Conversion
Stars: ✭ 154 (-96.6%)
DSTC6-End-to-End-Conversation-Modeling
DSTC6: End-to-End Conversation Modeling Track
Stars: ✭ 56 (-98.76%)
Mutual labels:  chainer, end-to-end
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (-98.9%)
Mutual labels:  end-to-end, speech-recognition
speech separation
Constrained Permutation Invariant Training, Speech Separation
Stars: ✭ 27 (-99.4%)
Mutual labels:  speech-separation
Wire Ios
📱 Wire for iOS (iPhone and iPad)
Stars: ✭ 3,079 (-32.08%)
Mutual labels:  end-to-end
waifu2x-chainer
Chainer implementation of waifu2x
Stars: ✭ 137 (-96.98%)
Mutual labels:  chainer
kim-voice-assistant
Kim,你的私人语音助理。
Stars: ✭ 70 (-98.46%)
Mutual labels:  speech-recognition
Bytenet Tensorflow
ByteNet for character-level language modelling
Stars: ✭ 319 (-92.96%)
Mutual labels:  machine-translation
React Transcript Editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Stars: ✭ 285 (-93.71%)
Mutual labels:  kaldi
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.29%)
Mutual labels:  speech-recognition
Voice-Denoising-AN
A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-99.07%)
Mutual labels:  speech-enhancement
Alan Sdk Android
Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-93.87%)
Mutual labels:  speech-recognition
Ajax-Chat
Ajax Chat is a complete web chat in javascript, ajax, php and mysql compatible with Phonegap
Stars: ✭ 19 (-99.58%)
Mutual labels:  end-to-end
Espeak
eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: ✭ 339 (-92.52%)
Mutual labels:  speech-synthesis
Gp Gan
Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)
Stars: ✭ 317 (-93.01%)
Mutual labels:  chainer
nepali-translator
Neural Machine Translation on the Nepali-English language pair
Stars: ✭ 29 (-99.36%)
Mutual labels:  machine-translation
Recording-Bot
A bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-99.51%)
Mutual labels:  speech-recognition
StageMate
StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-98.68%)
Mutual labels:  speech-recognition
Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-93.85%)
Mutual labels:  speech-synthesis
Tacotron pytorch
Tacotron implementation of pytorch
Stars: ✭ 12 (-99.74%)
Mutual labels:  speech-synthesis
Alan Sdk Flutter
Alan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-93.18%)
Mutual labels:  speech-recognition
Neuraldialog Cvae
Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 279 (-93.85%)
Mutual labels:  end-to-end
dropclass speaker
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-99.56%)
Mutual labels:  kaldi
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-98.37%)
Mutual labels:  speech-synthesis
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (-93.89%)
Mutual labels:  speech-recognition
Multi-Hotword Spotting
Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-99.32%)
Mutual labels:  speech-recognition
download audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-98.83%)
Mutual labels:  speech-recognition
Multilingual text to speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (-92.85%)
Mutual labels:  speech-synthesis
Cognitive Speech Tts
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-93.12%)
Mutual labels:  speech-synthesis
Transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
Stars: ✭ 271 (-94.02%)
Mutual labels:  machine-translation
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.58%)
Mutual labels:  speech-recognition
ocaml-otr
Off-the-record (OTR) messaging protocol, purely in OCaml
Stars: ✭ 39 (-99.14%)
Mutual labels:  end-to-end
Attention-Visualization
Visualization for simple attention and Google's multi-head attention.
Stars: ✭ 54 (-98.81%)
Mutual labels:  machine-translation
Zhihu
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (-27.05%)
Mutual labels:  machine-translation
Alan Sdk Cordova
Alan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-94.07%)
Mutual labels:  speech-recognition
61-120 of 717 similar projects