PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Stars: ✭ 2,934 (+140.69%)

Mutual labels: speech-recognition

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-96.47%)

Mutual labels: speech-recognition

Voice Synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Stars: ✭ 51 (-95.82%)

Mutual labels: speech-to-text

Wavenet Stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

Stars: ✭ 18 (-98.52%)

Mutual labels: speech-recognition

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (-69.32%)

Mutual labels: speech-recognition

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (-95.9%)

Mutual labels: speech

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-91.47%)

Mutual labels: speech-recognition

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-78.75%)

Mutual labels: speech

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-98.52%)

Mutual labels: speech

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (-48.07%)

Mutual labels: speech-recognition

Subsync

Subtitle Speech Synchronizer

Stars: ✭ 379 (-68.91%)

Mutual labels: speech-recognition

pocketsphinx

Updated ROS bindings to pocketsphinx

Stars: ✭ 36 (-97.05%)

Mutual labels: speech-recognition

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-95.98%)

Mutual labels: speech

nlp-class

A Natural Language Processing course taught by Professor Ghassemi

Stars: ✭ 95 (-92.21%)

Mutual labels: speech

Speechpy

💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

Stars: ✭ 833 (-31.67%)

Mutual labels: speech-recognition

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

Stars: ✭ 368 (-69.81%)

Mutual labels: speech-recognition

TASNET

Time-domain Audio Separation Network (IN PYTORCH)

Stars: ✭ 18 (-98.52%)

Mutual labels: speech

Speech-Command-Recognition-with-Capsule-Network

Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.

Stars: ✭ 20 (-98.36%)

Mutual labels: speech-recognition

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (-74.98%)

Mutual labels: speech

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-94.5%)

Mutual labels: speech

Android-TTS-STT

One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

Stars: ✭ 77 (-93.68%)

Mutual labels: speech-recognition

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-98.85%)

Mutual labels: speech-recognition

Open stt

Open STT

Stars: ✭ 584 (-52.09%)

Mutual labels: speech-to-text

Nlp Paper

自然语言处理领域下的对话语音领域，整理相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

Stars: ✭ 67 (-94.5%)

Mutual labels: speech

Audio Signal Processing

Audio or speech signal processing guide.

Stars: ✭ 45 (-96.31%)

Mutual labels: speech

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+271.86%)

Mutual labels: speech-recognition

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-98.28%)

Mutual labels: speech-recognition

Android Speech Recognition

Continuous speech recognition library for Android with options to use GoogleVoiceIme dialog and offline mode.

Stars: ✭ 72 (-94.09%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (-55.54%)

Mutual labels: speech-recognition

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-70.3%)

Mutual labels: speech

NLP Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks

Stars: ✭ 92 (-92.45%)

Mutual labels: speech-recognition

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-98.03%)

Mutual labels: speech

Recording-Bot

A bot built to record and transcribe audio fragments from Discord.

Stars: ✭ 22 (-98.2%)

Mutual labels: speech-recognition

benchmarkstt

Open Source AI Benchmarking toolkit for benchmarking speech to text services

Stars: ✭ 43 (-96.47%)

Mutual labels: speech-to-text

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-96.72%)

Mutual labels: speech

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (-33.72%)

Mutual labels: speech-recognition

Deepspeech Examples

Examples of how to use or integrate DeepSpeech

Stars: ✭ 356 (-70.8%)

Mutual labels: speech-recognition

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-98.44%)

Mutual labels: speech

voicekit-examples

Examples on how to use Tinkoff Voicekit

Stars: ✭ 35 (-97.13%)

Mutual labels: speech-recognition

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

Stars: ✭ 150 (-87.69%)

Mutual labels: speech-recognition

formulas-python

Ritchie CLI formulas in Python 🐍

Stars: ✭ 17 (-98.61%)

Mutual labels: speech-recognition

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-70.96%)

Mutual labels: speech-recognition

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (-90.73%)

Mutual labels: speech-recognition

Parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.

Stars: ✭ 48 (-96.06%)

Mutual labels: speech-recognition

Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.

Stars: ✭ 27 (-97.79%)

Mutual labels: speech-recognition

Inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (-71.12%)

Mutual labels: speech

scription

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

Stars: ✭ 46 (-96.23%)

Mutual labels: speech-to-text

gtranscribe

Software for interview transcription

Stars: ✭ 12 (-99.02%)

Mutual labels: speech

241-300 of 495 similar projects

first

‹

›