Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+917.07%)

Mutual labels: speech-synthesis, speech-recognition

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (-42.44%)

Mutual labels: speech-recognition, speech-processing

Speech ai

Simple speech linguistic AI with Python

Stars: ✭ 66 (-67.8%)

Mutual labels: speech-synthesis, speech-recognition

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+393.17%)

Mutual labels: speech-synthesis, speech-recognition

ml-with-audio

HF's ML for Audio study group

Stars: ✭ 104 (-49.27%)

Mutual labels: speech-synthesis, speech-recognition

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+706.83%)

Mutual labels: speech-synthesis, speech-processing

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+572.2%)

Mutual labels: speech-synthesis, speech-recognition

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-35.12%)

Mutual labels: speech-synthesis, speech-recognition

Vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Stars: ✭ 158 (-22.93%)

Mutual labels: speech-synthesis, speech-processing

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-75.61%)

Mutual labels: speech-synthesis, speech-recognition

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (+2.93%)

Mutual labels: speech-synthesis, speech-processing

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (-54.15%)

Mutual labels: speech-recognition, speech-processing

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

Stars: ✭ 150 (-26.83%)

Mutual labels: speech-recognition, speech-processing

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-59.02%)

Mutual labels: speech-synthesis, speech-recognition

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+272.68%)

Mutual labels: speech-recognition, speech-processing

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (+228.29%)

Mutual labels: speech-recognition, speech-processing

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-79.02%)

Mutual labels: speech-recognition, speech-processing

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+72.68%)

Mutual labels: speech-synthesis, speech-recognition

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+18.05%)

Mutual labels: speech-recognition, speech-processing

porfir

Голосовой ассистент Порфирьевич

Stars: ✭ 23 (-88.78%)

Mutual labels: speech-synthesis, speech-recognition

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-28.78%)

Mutual labels: speech-recognition, speech-processing

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (-87.8%)

Mutual labels: speech-synthesis, speech-recognition

voicekit-examples

Examples on how to use Tinkoff Voicekit

Stars: ✭ 35 (-82.93%)

Mutual labels: speech-synthesis, speech-recognition

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-82.93%)

Mutual labels: speech-synthesis, speech-recognition

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+839.51%)

Mutual labels: speech-synthesis, speech-processing

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+164.39%)

Mutual labels: speech-synthesis, speech-recognition

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+139.02%)

Mutual labels: speech-synthesis, speech-recognition

Cross vc

Cross-lingual Voice Conversion

Stars: ✭ 91 (-55.61%)

Mutual labels: speech-synthesis, speech-recognition

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+2111.22%)

Mutual labels: speech-synthesis, speech-recognition

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+636.1%)

Mutual labels: speech-synthesis, speech-recognition

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-49.76%)

Mutual labels: speech-synthesis, speech-recognition

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (+50.24%)

Mutual labels: speech-synthesis, speech-processing

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-16.59%)

Mutual labels: speech-synthesis, speech-recognition

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+43.9%)

Mutual labels: speech-synthesis, speech-processing

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-69.27%)

Mutual labels: speech-recognition, speech-processing

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (-70.73%)

Mutual labels: speech-synthesis

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-49.27%)

Mutual labels: speech-recognition

salutejs

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript

Stars: ✭ 35 (-82.93%)

Mutual labels: speech-recognition

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (-28.29%)

Mutual labels: speech-recognition

QPPWG

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Stars: ✭ 41 (-80%)

Mutual labels: speech-synthesis

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (-77.56%)

Mutual labels: speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (-47.8%)

Mutual labels: speech-synthesis

Android-TTS-STT

One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

Stars: ✭ 77 (-62.44%)

Mutual labels: speech-recognition

1-60 of 480 similar projects

›

next*5