Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+4243.75%)

Mutual labels: tts, speech-recognition

Dla

Deep learning for audio processing

Stars: ✭ 142 (+195.83%)

Mutual labels: speech-recognition, tts

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+1029.17%)

Mutual labels: speech-recognition, tts

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+1652.08%)

Mutual labels: tts, speech-recognition

Py Nltools

A collection of basic python modules for spoken natural language processing

Stars: ✭ 46 (-4.17%)

Mutual labels: speech-recognition, tts

Arvutaja

An Android app for voice actions in Estonian and English

Stars: ✭ 28 (-41.67%)

Mutual labels: speech-recognition

Ekho

Chinese text-to-speech engine

Stars: ✭ 690 (+1337.5%)

Mutual labels: tts

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+1320.83%)

Mutual labels: tts

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (+1302.08%)

Mutual labels: speech-recognition

Pncc

A implementation of Power Normalized Cepstral Coefficients: PNCC

Stars: ✭ 40 (-16.67%)

Mutual labels: speech-recognition

Rhasspy

Rhasspy voice assistant for offline home automation

Stars: ✭ 851 (+1672.92%)

Mutual labels: speech-recognition

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+1218.75%)

Mutual labels: speech-recognition

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+1195.83%)

Mutual labels: speech-recognition

Assistant Client

Инструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"

Stars: ✭ 26 (-45.83%)

Mutual labels: speech-recognition

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+1185.42%)

Mutual labels: speech-recognition

Real Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Stars: ✭ 32,095 (+66764.58%)

Mutual labels: tts

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-10.42%)

Mutual labels: speech-recognition

Voice

🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)

Stars: ✭ 993 (+1968.75%)

Mutual labels: speech-recognition

Speechpy

💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

Stars: ✭ 833 (+1635.42%)

Mutual labels: speech-recognition

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+1008.33%)

Mutual labels: speech-recognition

Mtrans

Multi-source Translation

Stars: ✭ 711 (+1381.25%)

Mutual labels: tts

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-35.42%)

Mutual labels: tts

Adapt

Adapt Intent Parser

Stars: ✭ 690 (+1337.5%)

Mutual labels: speech-recognition

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+2006.25%)

Mutual labels: speech-recognition

Wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 5,907 (+12206.25%)

Mutual labels: speech-recognition

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-41.67%)

Mutual labels: tts

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+12397.92%)

Mutual labels: speech-recognition

Cortex M Kws

Cortex M KWS example with Tengine Lite.

Stars: ✭ 45 (-6.25%)

Mutual labels: speech-recognition

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (+1218.75%)

Mutual labels: speech-recognition

Kaldi Gstreamer Server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Stars: ✭ 935 (+1847.92%)

Mutual labels: speech-recognition

Transformertts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Stars: ✭ 617 (+1185.42%)

Mutual labels: tts

Tacotron Wavernn

TTS (Tacotron + WaveRNN)

Stars: ✭ 40 (-16.67%)

Mutual labels: tts

Wavenet Stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

Stars: ✭ 18 (-62.5%)

Mutual labels: speech-recognition

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+987.5%)

Mutual labels: speech-recognition

Speech recognition

中文语音识别

Stars: ✭ 534 (+1012.5%)

Mutual labels: speech-recognition

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-2.08%)

Mutual labels: speech-recognition

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (+1002.08%)

Mutual labels: speech-recognition

Kur

Descriptive Deep Learning

Stars: ✭ 811 (+1589.58%)

Mutual labels: speech-recognition

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-27.08%)

Mutual labels: speech-recognition

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+927.08%)

Mutual labels: tts

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+1583.33%)

Mutual labels: speech-recognition

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+920.83%)

Mutual labels: speech-recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (+902.08%)

Mutual labels: speech-recognition

Zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

Stars: ✭ 771 (+1506.25%)

Mutual labels: tts

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (+902.08%)

Mutual labels: speech-recognition

Rhasspy

Offline private voice assistant for many human languages

Stars: ✭ 458 (+854.17%)

Mutual labels: speech-recognition

Avsr Deep Speech

Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab

Stars: ✭ 43 (-10.42%)

Mutual labels: speech-recognition

Wsay

Windows "say"

Stars: ✭ 36 (-25%)

Mutual labels: tts

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (+1508.33%)

Mutual labels: speech-recognition

Speech Demo

语音api示例

Stars: ✭ 454 (+845.83%)

Mutual labels: speech-recognition

Uspeech

Speech recognition toolkit for the arduino

Stars: ✭ 448 (+833.33%)

Mutual labels: speech-recognition

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.