End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-96.98%)

Mutual labels: speech-synthesis

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Stars: ✭ 799 (-51.69%)

Mutual labels: speech-synthesis

Flutter tts

Flutter Text to Speech package

Stars: ✭ 263 (-84.1%)

Mutual labels: tts

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (-96.37%)

Mutual labels: speech-synthesis

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-93.05%)

Mutual labels: speech-processing

Prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (-91.6%)

Mutual labels: speech-synthesis

py-espeak-ng

Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.

Stars: ✭ 27 (-98.37%)

Mutual labels: tts

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-91.6%)

Mutual labels: speech-synthesis

Real Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Stars: ✭ 32,095 (+1840.45%)

Mutual labels: tts

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Stars: ✭ 136 (-91.78%)

Mutual labels: speech-synthesis

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 6,026 (+264.33%)

Mutual labels: end-to-end

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-91.96%)

Mutual labels: speech-synthesis

Pink Trombone

A programmable version of Neil Thapen's Pink Trombone

Stars: ✭ 54 (-96.74%)

Mutual labels: speech-synthesis

Awesome Speech Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Stars: ✭ 257 (-84.46%)

Mutual labels: speech-processing

magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Stars: ✭ 76 (-95.41%)

Mutual labels: tts

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (-97.22%)

Mutual labels: speech-synthesis

Vq Vae Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Stars: ✭ 187 (-88.69%)

Mutual labels: speech-processing

Joytan

Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel

Stars: ✭ 91 (-94.5%)

Mutual labels: tts

Sdk Js

Tanker client-side encryption SDK for JavaScript

Stars: ✭ 786 (-52.48%)

Mutual labels: end-to-end

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Stars: ✭ 139 (-91.6%)

Mutual labels: speech-processing

FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊

Stars: ✭ 64 (-96.13%)

Mutual labels: tts

Tutorial separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Stars: ✭ 151 (-90.87%)

Mutual labels: speech-processing

pygtrans

谷歌翻译, 支持 APIKEY 一口气翻译十万条

Stars: ✭ 60 (-96.37%)

Mutual labels: tts

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-91.17%)

Mutual labels: speech-processing

Inspectit

inspectIT is the leading Open Source APM (Application Performance Management) tool for analyzing your Java (EE) applications.

Stars: ✭ 513 (-68.98%)

Mutual labels: end-to-end

A Convolutional Recurrent Neural Network For Real Time Speech Enhancement

A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch

Stars: ✭ 123 (-92.56%)

Mutual labels: speech-processing

DiscordEncryption

🔐 Configurable end to end encryption for Discord

Stars: ✭ 30 (-98.19%)

Mutual labels: end-to-end

Sstd

Single Shot Text Detector with Regional Attention

Stars: ✭ 221 (-86.64%)

Mutual labels: end-to-end

Fullsubnet

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Stars: ✭ 51 (-96.92%)

Mutual labels: speech-processing

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (-88.51%)

Mutual labels: end-to-end

text-to-speech

⚡️ Capacitor plugin for synthesizing speech from text.

Stars: ✭ 50 (-96.98%)

Mutual labels: tts

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-89.42%)

Mutual labels: end-to-end

Lanedetection end2end

End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)

Stars: ✭ 500 (-69.77%)

Mutual labels: end-to-end

Listen Attend Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Stars: ✭ 147 (-91.11%)

Mutual labels: end-to-end

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-96.98%)

Mutual labels: end-to-end

Mtts

A Demo of Mandarin/Chinese TTS frontend

Stars: ✭ 229 (-86.15%)

Mutual labels: tts

Wave U Net For Speech Enhancement

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Stars: ✭ 106 (-93.59%)

Mutual labels: speech-processing

Multi Tacotron Voice Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Stars: ✭ 192 (-88.39%)

Mutual labels: tts

shairport-sync

AirPlay audio player. Shairport Sync adds multi-room capability with Audio Synchronisation

Stars: ✭ 5,532 (+234.46%)

Mutual labels: multi-speaker

Speaker adapted tts

Making a TTS model with 1 minute of speech samples within 10 minutes

Stars: ✭ 183 (-88.94%)

Mutual labels: tts

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-70.37%)

Mutual labels: speech-synthesis

Gst Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Stars: ✭ 175 (-89.42%)

Mutual labels: tts

cypress-example-docker-circle-workflows

Cypress + Docker + CircleCI Workflows = ❤️

Stars: ✭ 29 (-98.25%)

Mutual labels: end-to-end

Melnet

Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"

Stars: ✭ 161 (-90.27%)

Mutual labels: tts

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-97.16%)

Mutual labels: speech-processing

mimic2

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.

Stars: ✭ 537 (-67.53%)

Mutual labels: speech-synthesis

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-92.87%)

Mutual labels: tts

Tf Kaldi Speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Stars: ✭ 117 (-92.93%)

Mutual labels: speech-processing

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (-8.77%)

Mutual labels: speech-synthesis

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (-16.69%)

Mutual labels: speech-synthesis

Drachtio Freeswitch Modules

A collection of open-sourced freeswitch modules that I use in various drachtio applications

Stars: ✭ 73 (-95.59%)

Mutual labels: tts

Zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

Stars: ✭ 771 (-53.39%)

Mutual labels: tts

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-94.68%)

Mutual labels: speech-processing

301-360 of 369 similar projects

first

‹

›