All Projects → kaldi_helpers → Similar Projects or Alternatives

388 Open source projects that are alternatives of or similar to kaldi_helpers

MajorDomo-Scenarios
Сценарии для системы домашней автоматизации Majordomo
Stars: ✭ 12 (-7.69%)
Mutual labels:  speech
lxa5
Linguistica 5: Unsupervised Learning of Linguistic Structure
Stars: ✭ 27 (+107.69%)
Mutual labels:  computational-linguistics
yap
Yet Another (natural language) Parser
Stars: ✭ 40 (+207.69%)
Mutual labels:  computational-linguistics
citation-function
Measuring the Evolution of a Scientific Field through Citation Frames
Stars: ✭ 40 (+207.69%)
Mutual labels:  computational-linguistics
Chinese-automatic-speech-recognition
Chinese speech recognition
Stars: ✭ 147 (+1030.77%)
Mutual labels:  speech-to-text
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (+153.85%)
Mutual labels:  speech
pylangacq
Language Acquisition Research Tools
Stars: ✭ 33 (+153.85%)
Mutual labels:  computational-linguistics
ventib
📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (+230.77%)
Mutual labels:  speech
nytwit
New York Times Word Innovation Types dataset
Stars: ✭ 21 (+61.54%)
Mutual labels:  computational-linguistics
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (+515.38%)
Mutual labels:  speech
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (+0%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+1138.46%)
Mutual labels:  speech
txt2speech
Convert text to speech using Google Translate API
Stars: ✭ 38 (+192.31%)
Mutual labels:  speech
hf-experiments
Experiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (+184.62%)
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (+169.23%)
Mutual labels:  speech-to-text
data-at-hand-mobile
Mobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (+284.62%)
Mutual labels:  speech
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (+7.69%)
Mutual labels:  speech-to-text
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+169.23%)
Mutual labels:  speech-to-text
obvi
A Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (+315.38%)
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+346.15%)
Mutual labels:  speech
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (+92.31%)
Mutual labels:  speech
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+2169.23%)
Mutual labels:  speech
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+438.46%)
Mutual labels:  computational-linguistics
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (+92.31%)
Mutual labels:  speech-to-text
react-native-speech-bubble
💬 A speech bubble dialog component for React Native.
Stars: ✭ 50 (+284.62%)
Mutual labels:  speech
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+61.54%)
Mutual labels:  speech-to-text
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (+84.62%)
Mutual labels:  speech
VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+2038.46%)
Mutual labels:  speech
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+546.15%)
Mutual labels:  speech
datalinguist
Stanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+615.38%)
Mutual labels:  computational-linguistics
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+700%)
Mutual labels:  kaldi
benchmarkstt
Open Source AI Benchmarking toolkit for benchmarking speech to text services
Stars: ✭ 43 (+230.77%)
Mutual labels:  speech-to-text
Naver-AI-Hackathon-Speech
2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib
Stars: ✭ 26 (+100%)
Mutual labels:  speech
browser-apis
🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (+61.54%)
Mutual labels:  speech
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (+69.23%)
Mutual labels:  speech
lectures-all
Central repository for all lectures on deep learning at UPC ETSETB TelecomBCN.
Stars: ✭ 46 (+253.85%)
Mutual labels:  speech
glaemscribe
Glaemscribe, the tolkienian languages/writings transcription engine.
Stars: ✭ 29 (+123.08%)
Mutual labels:  transcription
esapp
An unsupervised Chinese word segmentation tool.
Stars: ✭ 13 (+0%)
Mutual labels:  computational-linguistics
Voice Gender
Gender recognition by voice and speech analysis
Stars: ✭ 248 (+1807.69%)
Mutual labels:  speech
embedding evaluation
Evaluate your word embeddings
Stars: ✭ 32 (+146.15%)
Mutual labels:  computational-linguistics
Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+1784.62%)
Mutual labels:  speech
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (+100%)
Mutual labels:  speech-to-text
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+18238.46%)
ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
Stars: ✭ 58 (+346.15%)
Mutual labels:  computational-linguistics
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+400%)
Mutual labels:  speech
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+1761.54%)
Mutual labels:  speech
Gcc Nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Stars: ✭ 231 (+1676.92%)
Mutual labels:  speech
wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline
Stars: ✭ 30 (+130.77%)
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+192.31%)
Mutual labels:  speech-to-text
Source separation
Deep learning based speech source separation using Pytorch
Stars: ✭ 226 (+1638.46%)
Mutual labels:  speech
Volute
Raspberry Pi + Nodejs = Speech Robot
Stars: ✭ 224 (+1623.08%)
Mutual labels:  speech
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (+130.77%)
Mutual labels:  speech-to-text
Speech Denoiser
A speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (+1592.31%)
Mutual labels:  speech
Speech Enhancement
Deep learning for audio denoising
Stars: ✭ 207 (+1492.31%)
Mutual labels:  speech
vspeech
📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜
Stars: ✭ 38 (+192.31%)
Mutual labels:  speech-to-text
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (+38.46%)
Mutual labels:  speech
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+1000%)
Mutual labels:  speech
Tts Cube
End-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (+1538.46%)
Mutual labels:  speech
linguistics problems
Natural language processing in examples and games
Stars: ✭ 23 (+76.92%)
Mutual labels:  computational-linguistics
Neural Voice Cloning With Few Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Stars: ✭ 211 (+1523.08%)
Mutual labels:  speech
61-120 of 388 similar projects