All Projects → Setk → Similar Projects or Alternatives

227 Open source projects that are alternatives of or similar to Setk

Go bindings for Flite (festival-lite)

Stars: ✭ 14 (-93.83%)

Mutual labels: speech

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+454.19%)

Mutual labels: speech

dropclass speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Stars: ✭ 20 (-91.19%)

Mutual labels: kaldi

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+748.46%)

Mutual labels: speech

An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.

Stars: ✭ 51 (-77.53%)

Mutual labels: speech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+437%)

Mutual labels: speech

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'

Stars: ✭ 40 (-82.38%)

Mutual labels: speech

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (-19.82%)

Mutual labels: speech

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (-81.5%)

Mutual labels: speech

Ivector Xvector

Extract xvector and ivector under kaldi

Stars: ✭ 67 (-70.48%)

Mutual labels: kaldi

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-67.84%)

Mutual labels: speech

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-39.21%)

Mutual labels: speech

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-77.09%)

Mutual labels: speech

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-71.81%)

Mutual labels: speech

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-87.67%)

Mutual labels: speech

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

Stars: ✭ 202 (-11.01%)

Mutual labels: speech

p2p speech encrypting device with analog audio interface suitable for GSM phones

Stars: ✭ 26 (-88.55%)

Mutual labels: speech

AI智能审查，支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能，以及各种OCR识别能力，如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能，可以访问网站体验功能。

Stars: ✭ 60 (-73.57%)

Mutual labels: kaldi

A Simulation Framework for Auditory Discrimination Experiments

Stars: ✭ 12 (-94.71%)

Mutual labels: speech

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-40.53%)

Mutual labels: speech

kaldi-timit-sre-ivector

Develop speaker recognition model based on i-vector using TIMIT database

Stars: ✭ 17 (-92.51%)

Mutual labels: kaldi

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-74.89%)

Mutual labels: speech

a simple php implementation of a Nabaztag server

Stars: ✭ 14 (-93.83%)

Mutual labels: speech

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-22.91%)

Mutual labels: speech

KAREN: Unifying Hatespeech Detection and Benchmarking

Stars: ✭ 18 (-92.07%)

Mutual labels: speech

The ITU-T Software Tool Library (G.191)

Stars: ✭ 44 (-80.62%)

Mutual labels: speech

End-to-End Neural Diarization

Stars: ✭ 153 (-32.6%)

Mutual labels: kaldi

Wavenet Enhancement

Speech Enhancement using Bayesian WaveNet

Stars: ✭ 86 (-62.11%)

Mutual labels: speech

Speech Vad Demo

集成Webrtc的VAD，用于切分音频文件

Stars: ✭ 259 (+14.1%)

Mutual labels: speech

an open source voice command macro software

Stars: ✭ 130 (-42.73%)

Mutual labels: speech

A Natural Language Processing course taught by Professor Ghassemi

Stars: ✭ 95 (-58.15%)

Mutual labels: speech

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-82.38%)

Mutual labels: speech

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-70.48%)

Mutual labels: speech

Speech Enhancement

Deep learning for audio denoising

Stars: ✭ 207 (-8.81%)

Mutual labels: speech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-1.32%)

Mutual labels: speech

Voxceleb Ivector

Voxceleb1 i-vector based speaker recognition system

Stars: ✭ 36 (-84.14%)

Mutual labels: kaldi

Software for interview transcription

Stars: ✭ 12 (-94.71%)

Mutual labels: speech

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-43.61%)

Mutual labels: speech

KaldiBasedSpeakerVerification

Kaldi based speaker verification

Stars: ✭ 43 (-81.06%)

Mutual labels: kaldi

Theano Kaldi Rnn

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Stars: ✭ 31 (-86.34%)

Mutual labels: kaldi

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-82.38%)

Mutual labels: speech

Deep speaker Speaker recognition system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Stars: ✭ 174 (-23.35%)

Mutual labels: speech

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-83.7%)

Mutual labels: speech

c++ Kaldi IO lib (static and dynamic).

Stars: ✭ 22 (-90.31%)

Mutual labels: kaldi

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-91.19%)

Mutual labels: speech

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (+0.88%)

Mutual labels: speech

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (-13.66%)

Mutual labels: kaldi

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

Stars: ✭ 85 (-62.56%)

Mutual labels: kaldi

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-78.41%)

Mutual labels: speech

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

Stars: ✭ 20 (-91.19%)

Mutual labels: kaldi

💬 Speech recognition for your site

Stars: ✭ 6,216 (+2638.33%)

Mutual labels: speech

data-at-hand-mobile

Mobile application for exploring fitness data using both speech and touch interaction.

Stars: ✭ 50 (-77.97%)

Mutual labels: speech

Code Switching Papers

A curated list of research papers and resources on code-switching

Stars: ✭ 122 (-46.26%)

Mutual labels: speech

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-61.23%)

Mutual labels: speech

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+191.19%)

Mutual labels: speech

Source separation

Deep learning based speech source separation using Pytorch

Stars: ✭ 226 (-0.44%)

Mutual labels: speech

Speech Denoiser

A speech denoise lv2 plugin based on RNNoise library

Stars: ✭ 220 (-3.08%)

Mutual labels: speech

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-9.69%)

Mutual labels: speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Stars: ✭ 187 (-17.62%)

Mutual labels: speech

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-33.48%)

Mutual labels: kaldi

121-180 of 227 similar projects