The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+469.77%)

Mutual labels: speech

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-59.3%)

Mutual labels: speech

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+374.42%)

Mutual labels: speech

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+254.65%)

Mutual labels: speech

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-33.72%)

Mutual labels: speech

Wavenet Stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

Stars: ✭ 18 (-79.07%)

Mutual labels: wavenet

Inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (+309.3%)

Mutual labels: speech

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+518.6%)

Mutual labels: speech

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-53.49%)

Mutual labels: speech

Flowavenet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Stars: ✭ 471 (+447.67%)

Mutual labels: wavenet

Tf Wavenet vocoder

Wavenet and its applications with Tensorflow

Stars: ✭ 58 (-32.56%)

Mutual labels: wavenet

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+356.98%)

Mutual labels: speech

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-63.95%)

Mutual labels: speech

Pycadl

Python package with source code from the course "Creative Applications of Deep Learning w/ TensorFlow"

Stars: ✭ 356 (+313.95%)

Mutual labels: wavenet

Chainer Vq Vae

A Chainer implementation of VQ-VAE.

Stars: ✭ 77 (-10.47%)

Mutual labels: wavenet

Ios 10 Sampler

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+3784.88%)

Mutual labels: speech

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+7127.91%)

Mutual labels: speech

Css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Stars: ✭ 302 (+251.16%)

Mutual labels: speech

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+245.35%)

Mutual labels: speech

Soloud

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (+1118.6%)

Mutual labels: speech

Praat

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (+684.88%)

Mutual labels: speech

Clarinet

A Pytorch Implementation of ClariNet

Stars: ✭ 273 (+217.44%)

Mutual labels: wavenet

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (+1082.56%)

Mutual labels: speech

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+473.26%)

Mutual labels: speech

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-25.58%)

Mutual labels: speech

Xr3player

🎧 🎼 Advanced JavaFX Media Player

Stars: ✭ 472 (+448.84%)

Mutual labels: speech

Vq Vae Wavenet

TensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810

Stars: ✭ 40 (-53.49%)

Mutual labels: wavenet

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+408.14%)

Mutual labels: speech

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+1317.44%)

Mutual labels: speech

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+374.42%)

Mutual labels: speech

Wsay

Windows "say"

Stars: ✭ 36 (-58.14%)

Mutual labels: speech

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+346.51%)

Mutual labels: speech

Sound Source Localization Algorithm doa estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stars: ✭ 58 (-32.56%)

Mutual labels: speech

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+320.93%)

Mutual labels: speech

Pytorch Uniwavenet

Stars: ✭ 30 (-65.12%)

Mutual labels: wavenet

Time Series Prediction

A collection of time series prediction methods: rnn, seq2seq, cnn, wavenet, transformer, unet, n-beats, gan, kalman-filter

Stars: ✭ 351 (+308.14%)

Mutual labels: wavenet

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+1362.79%)

Mutual labels: speech

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+6210.47%)

Mutual labels: speech

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+779.07%)

Mutual labels: speech

Android Speech

Android speech recognition and text to speech made easy

Stars: ✭ 310 (+260.47%)

Mutual labels: speech

Wavenet

WaveNet implementation with chainer

Stars: ✭ 53 (-38.37%)

Mutual labels: wavenet

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (+246.51%)

Mutual labels: speech

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+693.02%)

Mutual labels: wavenet

Sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (+234.88%)

Mutual labels: speech

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-19.77%)

Mutual labels: speech

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+668.6%)

Mutual labels: speech

Pytorchwavenetvocoder

WaveNet-Vocoder implementation with pytorch.

Stars: ✭ 269 (+212.79%)

Mutual labels: wavenet

Speech Vad Demo

集成Webrtc的VAD，用于切分音频文件

Stars: ✭ 259 (+201.16%)

Mutual labels: speech

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (+201.16%)

Mutual labels: speech

Tacotron2

pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf

Stars: ✭ 46 (-46.51%)

Mutual labels: wavenet

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+636.05%)

Mutual labels: speech

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (+166.28%)

Mutual labels: speech

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-43.02%)

Mutual labels: speech

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+623.26%)

Mutual labels: speech

Audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Stars: ✭ 1,262 (+1367.44%)

Mutual labels: speech

1-60 of 212 similar projects

›