All Projects → Setk → Similar Projects or Alternatives

227 Open source projects that are alternatives of or similar to Setk

React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (-19.82%)
Mutual labels:  speech
Ivector Xvector
Extract xvector and ivector under kaldi
Stars: ✭ 67 (-70.48%)
Mutual labels:  kaldi
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (-39.21%)
Mutual labels:  speech
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-71.81%)
Mutual labels:  speech
Timit
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (-11.01%)
Mutual labels:  speech
Nhyai
AI智能审查,支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能,以及各种OCR识别能力,如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能,可以访问网站体验功能。
Stars: ✭ 60 (-73.57%)
Mutual labels:  kaldi
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-40.53%)
Mutual labels:  speech
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-74.89%)
Mutual labels:  speech
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (-22.91%)
Mutual labels:  speech
Stl
The ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-80.62%)
Mutual labels:  speech
Avpi
an open source voice command macro software
Stars: ✭ 130 (-42.73%)
Mutual labels:  speech
Dialectid e2e
End to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-82.38%)
Mutual labels:  speech
Speech Enhancement
Deep learning for audio denoising
Stars: ✭ 207 (-8.81%)
Mutual labels:  speech
Voxceleb Ivector
Voxceleb1 i-vector based speaker recognition system
Stars: ✭ 36 (-84.14%)
Mutual labels:  kaldi
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-43.61%)
Mutual labels:  speech
Theano Kaldi Rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (-86.34%)
Mutual labels:  kaldi
Deep speaker Speaker recognition system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Stars: ✭ 174 (-23.35%)
Mutual labels:  speech
Kaldi Io
c++ Kaldi IO lib (static and dynamic).
Stars: ✭ 22 (-90.31%)
Mutual labels:  kaldi
Audiomate
Python library for handling audio datasets.
Stars: ✭ 99 (-56.39%)
Mutual labels:  speech
Pocketsphinx Python
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+31.28%)
Mutual labels:  speech
Sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (+26.87%)
Mutual labels:  speech
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+2638.33%)
Mutual labels:  speech
Code Switching Papers
A curated list of research papers and resources on code-switching
Stars: ✭ 122 (-46.26%)
Mutual labels:  speech
Segan
Speech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+191.19%)
Mutual labels:  speech
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+755.51%)
Mutual labels:  speech
Factorized Tdnn
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-56.83%)
Mutual labels:  kaldi
Vosk Server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+22.03%)
Mutual labels:  kaldi
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+174.01%)
Mutual labels:  speech
Speech And Text Unity Ios Android
Speed to text in Unity iOS use Native Speech Recognition
Stars: ✭ 117 (-48.46%)
Mutual labels:  speech
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+134.36%)
Mutual labels:  speech
Volute
Raspberry Pi + Nodejs = Speech Robot
Stars: ✭ 224 (-1.32%)
Mutual labels:  speech
Montreal Forced Aligner
Command line utility for forced alignment using Kaldi
Stars: ✭ 490 (+115.86%)
Mutual labels:  kaldi
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (-49.34%)
Mutual labels:  speech
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+115.86%)
Mutual labels:  speech
Ctc pytorch
CTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (-29.07%)
Mutual labels:  kaldi
Cboard
AAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+92.51%)
Mutual labels:  speech
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-51.1%)
Mutual labels:  speech
Speechtotext Websockets Javascript
SDK & Sample to do speech recognition using websockets in Javascript
Stars: ✭ 191 (-15.86%)
Mutual labels:  speech
Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (+69.16%)
Mutual labels:  speech
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+551.54%)
Mutual labels:  speech
Tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (+34.36%)
Mutual labels:  speech
Tts Papers
🐸 collection of TTS papers
Stars: ✭ 160 (-29.52%)
Mutual labels:  speech
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+59.47%)
Mutual labels:  speech
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+497.8%)
Mutual labels:  kaldi
Asr theory
语音识别理论,论文和PPT
Stars: ✭ 344 (+51.54%)
Mutual labels:  kaldi
Neural Voice Cloning With Few Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Stars: ✭ 211 (-7.05%)
Mutual labels:  speech
Ios 10 Sampler
Code examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+1371.81%)
Mutual labels:  speech
Pytorch Kaldi Neural Speaker Embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Stars: ✭ 99 (-56.39%)
Mutual labels:  kaldi
Css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (+33.04%)
Mutual labels:  speech
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (-31.28%)
Mutual labels:  kaldi
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+30.84%)
Mutual labels:  speech
Wikipron
Massively multilingual pronunciation mining
Stars: ✭ 99 (-56.39%)
Mutual labels:  speech
React Transcript Editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Stars: ✭ 285 (+25.55%)
Mutual labels:  kaldi
Depression Detect
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-17.62%)
Mutual labels:  speech
Vosk Android Demo
Offline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+19.38%)
Mutual labels:  kaldi
Eend
End-to-End Neural Diarization
Stars: ✭ 153 (-32.6%)
Mutual labels:  kaldi
Wavenet Enhancement
Speech Enhancement using Bayesian WaveNet
Stars: ✭ 86 (-62.11%)
Mutual labels:  speech
Plda
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
Stars: ✭ 85 (-62.56%)
Mutual labels:  kaldi
Gtts
Python library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (+474.01%)
Mutual labels:  speech
Source separation
Deep learning based speech source separation using Pytorch
Stars: ✭ 226 (-0.44%)
Mutual labels:  speech
61-120 of 227 similar projects