All Projects → Nemo → Similar Projects or Alternatives

6723 Open source projects that are alternatives of or similar to Nemo

Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-96.39%)
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-62.61%)
Lingvo
Lingvo
Stars: ✭ 2,361 (-35.93%)
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-77.18%)
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-97.2%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-99.27%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+132.29%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-99.05%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-98.56%)
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-98.64%)
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-95.36%)
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-97.23%)
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (-59.05%)
Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (-93.35%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-86.7%)
Pytorch Dc Tts
Text to Speech with PyTorch (English and Mongolian)
Stars: ✭ 122 (-96.69%)
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-99.51%)
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-98.59%)
Hey Jetson
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (-95.63%)
Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (-93.13%)
Interspeech2019 Tutorial
INTERSPEECH 2019 Tutorial Materials
Stars: ✭ 160 (-95.66%)
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (-72.56%)
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+23.01%)
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-85.83%)
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (-81.49%)
Nmtpytorch
Sequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-89.36%)
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (-69.61%)
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.48%)
Cs224n Gpu That Talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-98.59%)
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-96.04%)
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (-97.18%)
Mutual labels:  text-to-speech, speech-synthesis
Wav2letter.pytorch
A fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (-97.18%)
Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (-55.6%)
Mutual labels:  text-to-speech, speech-synthesis
Self Supervised Speech Recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Stars: ✭ 106 (-97.12%)
Cross Lingual Voice Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Stars: ✭ 106 (-97.12%)
Mutual labels:  jupyter-notebook, text-to-speech
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (-97.07%)
Mutual labels:  text-to-speech, speech-synthesis
Bertqa Attention On Steroids
BertQA - Attention on Steroids
Stars: ✭ 112 (-96.96%)
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-96.99%)
Mutual labels:  text-to-speech, speech-synthesis
Go Astibob
Golang framework to build an AI that can understand and speak back to you, and everything else you want
Stars: ✭ 222 (-93.98%)
Mutual labels:  text-to-speech, speech-to-text
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-96.8%)
Nlp Models Tensorflow
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Stars: ✭ 1,603 (-56.5%)
Mutual labels:  jupyter-notebook, speech-to-text
Nlp Pretrained Model
A collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-96.69%)
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (-96.55%)
Marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: ✭ 1,699 (-53.89%)
Mutual labels:  text-to-speech, speech-synthesis
Alan Sdk Pcf
Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (-96.53%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+202.61%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-96.53%)
Seq2seq tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-96.42%)
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-96.23%)
Mutual labels:  text-to-speech, speech-synthesis
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-96.28%)
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (-96.26%)
Mutual labels:  text-to-speech, speech-synthesis
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (-35.36%)
Mutual labels:  text-to-speech, speech-synthesis
Subword Nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Stars: ✭ 1,819 (-50.64%)
Mutual labels:  machine-translation, nmt
Dla
Deep learning for audio processing
Stars: ✭ 142 (-96.15%)
Speechrecognizerbutton
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (-96.09%)
Pytorch Question Answering
Important paper implementations for Question Answering using PyTorch
Stars: ✭ 154 (-95.82%)
Natural Language Processing Specialization
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Stars: ✭ 151 (-95.9%)
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (-46.59%)
Mutual labels:  text-to-speech, speech-synthesis
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-95.9%)
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (-95.71%)
Mutual labels:  text-to-speech, speech-synthesis
1-60 of 6723 similar projects