libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-92.19%)

Mutual labels: speech-recognition, speech-synthesis

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (-98.99%)

Mutual labels: speech-synthesis, voice-conversion

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-99.6%)

Mutual labels: speech-synthesis, speech-recognition

ml-with-audio

HF's ML for Audio study group

Stars: ✭ 104 (-97.71%)

Mutual labels: speech-synthesis, speech-recognition

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Stars: ✭ 139 (-96.93%)

Mutual labels: machine-translation, speech-translation

Kaldi Onnx

Kaldi model converter to ONNX

Stars: ✭ 174 (-96.16%)

Mutual labels: speech-recognition, kaldi

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-96.14%)

Mutual labels: speech-recognition, end-to-end

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (-94.53%)

Mutual labels: speech-recognition, kaldi

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-98.15%)

Mutual labels: speech-synthesis, speech-recognition

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-99.23%)

Mutual labels: speech-synthesis, speech-recognition

SingleVC

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Stars: ✭ 25 (-99.45%)

Mutual labels: speech-synthesis, voice-conversion

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-93.27%)

Mutual labels: speech-recognition, end-to-end

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-98.61%)

Mutual labels: speech-recognition, speech-enhancement

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-98.79%)

Mutual labels: end-to-end, speech-synthesis

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-99.54%)

Mutual labels: speech-recognition, kaldi

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-99.4%)

Mutual labels: speech-synthesis, speech-recognition

voice-conversion

an tutorial implement of voice conversion using pytorch

Stars: ✭ 26 (-99.43%)

Mutual labels: speech-synthesis, voice-conversion

TinyCog

Small Robot, Toy Robot platform

Stars: ✭ 29 (-99.36%)

Mutual labels: speech-synthesis, speech-recognition

NLP Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks

Stars: ✭ 92 (-97.97%)

Mutual labels: machine-translation, speech-recognition

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-99.54%)

Mutual labels: end-to-end, speech-recognition

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-96.23%)

Mutual labels: speech-recognition, speech-synthesis

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (-53.74%)

Mutual labels: speech-recognition, kaldi

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (-95.81%)

Mutual labels: speech-recognition, end-to-end

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (-96.47%)

Mutual labels: speech-recognition, kaldi

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (-95.68%)

Mutual labels: speech-recognition, kaldi

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (-96.56%)

Mutual labels: speech-recognition, kaldi

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-94.02%)

Mutual labels: speech-recognition, kaldi

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (-89.94%)

Mutual labels: end-to-end, speech-recognition

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (-99.45%)

Mutual labels: speech-synthesis, speech-recognition

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-98.83%)

Mutual labels: speech-synthesis, speech-recognition

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-99.56%)

Mutual labels: end-to-end, speech-recognition

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-98.9%)

Mutual labels: speech-synthesis, speech-recognition

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-99.69%)

Mutual labels: speech-recognition, kaldi

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-96.67%)

Mutual labels: speech-recognition, kaldi

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-95.48%)

Mutual labels: speech-synthesis, speech-recognition

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Stars: ✭ 217 (-95.21%)

Mutual labels: speech-synthesis, voice-conversion

srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

Stars: ✭ 22 (-99.51%)

Mutual labels: speech-recognition, kaldi

Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Stars: ✭ 61 (-98.65%)

Mutual labels: end-to-end, speech-recognition

ppg-vc

PPG-Based Voice Conversion

Stars: ✭ 154 (-96.6%)

Mutual labels: speech-synthesis, voice-conversion

Voice-Separation-and-Enhancement

A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.

Stars: ✭ 60 (-98.68%)

Mutual labels: speech-separation, speech-enhancement

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-95.06%)

Mutual labels: speech-recognition, speech-separation

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-97.71%)

Mutual labels: speech-recognition, kaldi

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 6,026 (+32.94%)

Mutual labels: end-to-end, speech-recognition

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-98.9%)

Mutual labels: end-to-end, speech-recognition

vosk-model-ru-adaptation

No description or website provided.