All Projects → Setk → Similar Projects or Alternatives

227 Open source projects that are alternatives of or similar to Setk

Lhotse

Stars: ✭ 236 (+3.96%)

Mutual labels: speech, kaldi

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-45.37%)

Mutual labels: speech, kaldi

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+73.13%)

Mutual labels: speech, kaldi

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (-77.97%)

Mutual labels: speech, kaldi

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+823.79%)

Mutual labels: speech, kaldi

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+4812.33%)

Mutual labels: speech, kaldi

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-94.27%)

Mutual labels: speech, kaldi

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-93.83%)

Mutual labels: speech, kaldi

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (+14.1%)

Mutual labels: speech, kaldi

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+233.04%)

Mutual labels: speech, kaldi

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-38.77%)

Mutual labels: speech

Tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+673.57%)

Mutual labels: speech

Siricontrol System

Control anything with Siri voice commands.

Stars: ✭ 180 (-20.7%)

Mutual labels: speech

Esp8266sam

Speech synthesis for ESP8266 using S.A.M. port

Stars: ✭ 199 (-12.33%)

Mutual labels: speech

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (-41.85%)

Mutual labels: speech

Kaldi Onnx

Kaldi model converter to ONNX

Stars: ✭ 174 (-23.35%)

Mutual labels: kaldi

Voc

A physical model of the human vocal tract using literate programming, based on Pink Trombone.

Stars: ✭ 129 (-43.17%)

Mutual labels: speech

Reconstructing faces from voices

An example of the paper "reconstructing faces from voices"

Stars: ✭ 127 (-44.05%)

Mutual labels: speech

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (-13.66%)

Mutual labels: kaldi

Chatbot Watson Android

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Stars: ✭ 169 (-25.55%)

Mutual labels: speech

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (-48.02%)

Mutual labels: speech

Tts Cube

End-2-end speech synthesis with recurrent neural networks

Stars: ✭ 213 (-6.17%)

Mutual labels: speech

Lingvo

Stars: ✭ 2,361 (+940.09%)

Mutual labels: speech

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-27.31%)

Mutual labels: speech

Tf Kaldi Speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Stars: ✭ 117 (-48.46%)

Mutual labels: kaldi

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-49.78%)

Mutual labels: speech

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+748.46%)

Mutual labels: speech

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (-19.82%)

Mutual labels: speech

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-39.21%)

Mutual labels: speech

Timit

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

Stars: ✭ 202 (-11.01%)

Mutual labels: speech

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-40.53%)

Mutual labels: speech

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-22.91%)

Mutual labels: speech

Avpi

an open source voice command macro software

Stars: ✭ 130 (-42.73%)

Mutual labels: speech

Speech Enhancement

Deep learning for audio denoising

Stars: ✭ 207 (-8.81%)

Mutual labels: speech

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-43.61%)

Mutual labels: speech

Deep speaker Speaker recognition system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Stars: ✭ 174 (-23.35%)

Mutual labels: speech

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (-29.52%)

Mutual labels: kaldi

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-53.3%)

Mutual labels: speech

Code Switching Papers

A curated list of research papers and resources on code-switching

Stars: ✭ 122 (-46.26%)

Mutual labels: speech

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (-54.19%)

Mutual labels: kaldi

Speech And Text Unity Ios Android

Speed to text in Unity iOS use Native Speech Recognition

Stars: ✭ 117 (-48.46%)

Mutual labels: speech

Volute

Raspberry Pi + Nodejs = Speech Robot

Stars: ✭ 224 (-1.32%)

Mutual labels: speech

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-49.34%)

Mutual labels: speech

Ctc pytorch

CTC end -to-end ASR for timit and 863 corpus.

Stars: ✭ 161 (-29.07%)

Mutual labels: kaldi

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-51.1%)

Mutual labels: speech

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (-15.86%)

Mutual labels: speech

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+551.54%)

Mutual labels: speech

Tts Papers

🐸 collection of TTS papers

Stars: ✭ 160 (-29.52%)

Mutual labels: speech

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (-7.05%)

Mutual labels: speech

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+497.8%)

Mutual labels: kaldi

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (-16.74%)

Mutual labels: speech

Pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

Stars: ✭ 158 (-30.4%)

Mutual labels: kaldi

Elpis

🙊 WIP software for creating speech recognition models.

Stars: ✭ 101 (-55.51%)

Mutual labels: kaldi

Pytorch Kaldi Neural Speaker Embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Stars: ✭ 99 (-56.39%)

Mutual labels: kaldi

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (-31.28%)

Mutual labels: kaldi

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-56.39%)

Mutual labels: speech

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (-56.39%)

Mutual labels: speech

Depression Detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Stars: ✭ 187 (-17.62%)

Mutual labels: speech

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+755.51%)

Mutual labels: speech

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (-56.83%)

Mutual labels: kaldi

1-60 of 227 similar projects

›