pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+9885.71%)

Mutual labels: speech, speech-recognition, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+752.38%)

Mutual labels: speech, speech-recognition, asr

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+1066.67%)

Mutual labels: speech, tts

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+1052.38%)

Mutual labels: speech, speech-recognition

Pansori

Tools for ASR Corpus Generation from Online Video

Stars: ✭ 106 (+404.76%)

Mutual labels: corpus, speech-recognition

picovoice

The end-to-end platform for building voice products at scale

Stars: ✭ 316 (+1404.76%)

Mutual labels: voice, speech-recognition

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+1066.67%)

Mutual labels: speech, asr

brasiltts

Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…

Stars: ✭ 34 (+61.9%)

Mutual labels: voice, tts

api

Speechly public API definitions and generated code

Stars: ✭ 15 (-28.57%)

Mutual labels: voice, speech-recognition

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-14.29%)

Mutual labels: voice, speech

VoiceDictation

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Stars: ✭ 36 (+71.43%)

Mutual labels: voice, speech-recognition

react-client

An React client library for Speechly API

Stars: ✭ 71 (+238.1%)

Mutual labels: voice, speech-recognition

JSpeak

A Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features

Stars: ✭ 16 (-23.81%)

Mutual labels: voice, tts

voice-based-email-for-blind

Emailing System for visually impaired persons

Stars: ✭ 35 (+66.67%)

Mutual labels: voice, speech

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (+76.19%)

Mutual labels: voice, speech

vasisualy

Vasisualy it's a simple Russian voice assistant written on Python for GNU/Linux, Windows and Android.

Stars: ✭ 33 (+57.14%)

Mutual labels: voice, tts

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (+104.76%)

Mutual labels: voice, tts

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Stars: ✭ 53 (+152.38%)

Mutual labels: voice, speech-recognition

Neural Voice Cloning With Few Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Stars: ✭ 262 (+1147.62%)

Mutual labels: voice, tts

Alan Sdk Ionic

Alan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.