This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

Stars: ✭ 74 (-93.93%)

Mutual labels: speech-recognition

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (-33.72%)

Mutual labels: speech-recognition

Deepspeech Examples

Examples of how to use or integrate DeepSpeech

Stars: ✭ 356 (-70.8%)

Mutual labels: speech-recognition

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-98.44%)

Mutual labels: speech

cape

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-97.62%)

Mutual labels: speech

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (-96.55%)

Mutual labels: speech

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (-60.54%)

Mutual labels: speech-recognition

multilingual kws

Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus

Stars: ✭ 122 (-89.99%)

Mutual labels: speech-recognition

sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

Stars: ✭ 45 (-96.31%)

Mutual labels: speech-recognition

formulas-python

Ritchie CLI formulas in Python 🐍

Stars: ✭ 17 (-98.61%)

Mutual labels: speech-recognition

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-70.96%)

Mutual labels: speech-recognition

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (-90.73%)

Mutual labels: speech-recognition

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-94.01%)

Mutual labels: speech

Xr3player

🎧 🎼 Advanced JavaFX Media Player

Stars: ✭ 472 (-61.28%)

Mutual labels: speech

Parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.

Stars: ✭ 48 (-96.06%)

Mutual labels: speech-recognition

Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.

Stars: ✭ 27 (-97.79%)

Mutual labels: speech-recognition

Inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (-71.12%)

Mutual labels: speech

scription

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

Stars: ✭ 46 (-96.23%)

Mutual labels: speech-to-text

TextNormalizationCoveringGrammars

Covering grammars for English and Russian text normalization

Stars: ✭ 60 (-95.08%)

Mutual labels: speech-recognition

scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Stars: ✭ 17 (-98.61%)

Mutual labels: speech-recognition

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (-75.8%)

Mutual labels: speech

gtranscribe

Software for interview transcription

Stars: ✭ 12 (-99.02%)

Mutual labels: speech

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (-71.86%)

Mutual labels: speech-recognition

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (-93.6%)

Mutual labels: speech

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-97.95%)

Mutual labels: speech-recognition

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-97.46%)

Mutual labels: speech

Naver-AI-Hackathon-Speech

2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib

Stars: ✭ 26 (-97.87%)

Mutual labels: speech

porfir

Голосовой ассистент Порфирьевич

Stars: ✭ 23 (-98.11%)

Mutual labels: speech-recognition

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-98.85%)

Mutual labels: speech

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (-37.33%)

Mutual labels: speech-recognition

Autoedit 2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Stars: ✭ 343 (-71.86%)

Mutual labels: speech-to-text

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-96.72%)

Mutual labels: speech

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-97.7%)

Mutual labels: speech

DeepSegmentor

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Stars: ✭ 17 (-98.61%)

Mutual labels: speech

speech recognition ctc

Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别

Stars: ✭ 40 (-96.72%)

Mutual labels: speech

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+345.2%)

Mutual labels: speech

digital-paper-edit-client

Work in progress - BBC News Labs digital paper edit project - React Client

Stars: ✭ 36 (-97.05%)

Mutual labels: speech-to-text

Udacity Natural Language Processing Nanodegree

Tutorials and my solutions to the Udacity NLP Nanodegree

Stars: ✭ 73 (-94.01%)

Mutual labels: speech-to-text

Speech ai

Simple speech linguistic AI with Python

Stars: ✭ 66 (-94.59%)

Mutual labels: speech-recognition

Textnormalizationcoveringgrammars

Covering grammars for English and Russian text normalization

Stars: ✭ 46 (-96.23%)

Mutual labels: speech-recognition

converse

Conversational text Analysis using various NLP techniques

Stars: ✭ 147 (-87.94%)

Mutual labels: speech-to-text

Ios 10 Sampler

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+174.08%)

Mutual labels: speech

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-96.96%)

Mutual labels: speech

A chronology of deep learning

Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.

Stars: ✭ 47 (-96.14%)

Mutual labels: speech-recognition

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+1037.82%)

Mutual labels: speech

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-98.36%)

Mutual labels: speech

Nonocaptcha

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio