https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-40%)

Mutual labels: speech

nabaztag-php

a simple php implementation of a Nabaztag server

Stars: ✭ 14 (-65%)

Mutual labels: speech

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-47.5%)

Mutual labels: speech

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-65%)

Mutual labels: speech

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-47.5%)

Mutual labels: speech

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-7.5%)

Mutual labels: speech

LIGHT-SERNET

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

Stars: ✭ 20 (-50%)

Mutual labels: speech-emotion-recognition

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-50%)

Mutual labels: speech

speech recognition ctc

Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别

Stars: ✭ 40 (+0%)

Mutual labels: speech

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+105%)

Mutual labels: speech

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-12.5%)

Mutual labels: speech

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-67.5%)

Mutual labels: speech

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (+82.5%)

Mutual labels: speech

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+2002.5%)

Mutual labels: speech-emotion-recognition

TASNET

Time-domain Audio Separation Network (IN PYTORCH)

Stars: ✭ 18 (-55%)

Mutual labels: speech

voice-based-email-for-blind

Emailing System for visually impaired persons

Stars: ✭ 35 (-12.5%)

Mutual labels: speech

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+295%)

Mutual labels: speech

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Stars: ✭ 45 (+12.5%)

Mutual labels: speech

Audio Signal Processing

Audio or speech signal processing guide.

Stars: ✭ 45 (+12.5%)

Mutual labels: speech

aframe-speech-controls-component

alternative form of inputs for in-VR interaction with the content of a scene

Stars: ✭ 13 (-67.5%)

Mutual labels: speech

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (+5%)

Mutual labels: speech

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-55%)

Mutual labels: speech

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-52.5%)

Mutual labels: speech

attentive-modality-hopping-for-SER

TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20

Stars: ✭ 25 (-37.5%)

Mutual labels: speech-emotion-recognition

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-52.5%)

Mutual labels: speech

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+302.5%)

Mutual labels: speech

SpeechEmoRec

Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching

Stars: ✭ 44 (+10%)

Mutual labels: speech-emotion-recognition

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-45%)

Mutual labels: speech

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (+62.5%)

Mutual labels: speech

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (+95%)

Mutual labels: speech

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Stars: ✭ 143 (+257.5%)

Mutual labels: speech

HTK

The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.

Stars: ✭ 23 (-42.5%)

Mutual labels: speech

lidbox

End-to-end spoken language identification out of the box.

Stars: ✭ 39 (-2.5%)

Mutual labels: speech

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (+0%)

Mutual labels: speech

jarvis

Jarvis Home Automation

Stars: ✭ 81 (+102.5%)

Mutual labels: speech

wikipron

Massively multilingual pronunciation mining

Stars: ✭ 167 (+317.5%)

Mutual labels: speech

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-30%)

Mutual labels: speech

KAREN

KAREN: Unifying Hatespeech Detection and Benchmarking

Stars: ✭ 18 (-55%)

Mutual labels: speech

1-60 of 192 similar projects

›