All Projects → kaldi_helpers → Similar Projects or Alternatives

388 Open source projects that are alternatives of or similar to kaldi_helpers

AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+284.62%)
Mutual labels:  speech-to-text
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+307.69%)
Mutual labels:  speech-to-text
Tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+13407.69%)
Mutual labels:  speech
K6nele
An Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+1407.69%)
Mutual labels:  speech-to-text
hf-experiments
Experiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (+184.62%)
Nemo
NeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+28246.15%)
Mutual labels:  speech-to-text
Stl
The ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (+238.46%)
Mutual labels:  speech
obvi
A Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (+315.38%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+65746.15%)
Mutual labels:  speech-to-text
Dialectid e2e
End to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (+207.69%)
Mutual labels:  speech
Dictate.js
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+1400%)
Mutual labels:  speech-to-text
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (+961.54%)
Mutual labels:  speech
Expressive tacotron
Tensorflow Implementation of Expressive Tacotron
Stars: ✭ 192 (+1376.92%)
Mutual labels:  speech-to-text
Wsay
Windows "say"
Stars: ✭ 36 (+176.92%)
Mutual labels:  speech
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+346.15%)
Mutual labels:  speech
Athena
A free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity
Stars: ✭ 73 (+461.54%)
Mutual labels:  speech-to-text
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+1376.92%)
Mutual labels:  speech-to-text
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (+92.31%)
Mutual labels:  speech
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+969.23%)
Mutual labels:  speech
Voice Overlay Android
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+1353.85%)
Mutual labels:  speech-to-text
Praat
Praat: Doing Phonetics By Computer
Stars: ✭ 675 (+5092.31%)
Mutual labels:  speech
digital-paper-edit-client
Work in progress - BBC News Labs digital paper edit project - React Client
Stars: ✭ 36 (+176.92%)
Mutual labels:  speech-to-text
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+2169.23%)
Mutual labels:  speech
dataflow-contact-center-speech-analysis
Speech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.
Stars: ✭ 46 (+253.85%)
Mutual labels:  speech-to-text
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+4769.23%)
Mutual labels:  speech
revai-node-sdk
Node.js SDK for the Rev AI API
Stars: ✭ 21 (+61.54%)
Mutual labels:  speech-to-text
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+438.46%)
Mutual labels:  computational-linguistics
ArabicProcessingCog
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (+46.15%)
Mutual labels:  computational-linguistics
Tensorflow Speech Recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+16192.31%)
Mutual labels:  speech-to-text
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+61.54%)
Mutual labels:  speech-to-text
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+938.46%)
Mutual labels:  speech
Speaker adapted tts
Making a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (+1307.69%)
Mutual labels:  speech-to-text
sembei
🍘 単語分割を経由しない単語埋め込み 🍘
Stars: ✭ 14 (+7.69%)
Mutual labels:  computational-linguistics
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+3869.23%)
Mutual labels:  speech
SentimentAnalysis
Sentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (+146.15%)
Mutual labels:  computational-linguistics
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (+84.62%)
Mutual labels:  speech
sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Stars: ✭ 51 (+292.31%)
Mutual labels:  computational-linguistics
datastories-semeval2017-task6
Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (+53.85%)
Mutual labels:  computational-linguistics
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+546.15%)
Mutual labels:  speech
Kaldi Onnx
Kaldi model converter to ONNX
Stars: ✭ 174 (+1238.46%)
Mutual labels:  kaldi
Cboard
AAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+3261.54%)
Mutual labels:  speech
Kaldiio
A pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+1130.77%)
Mutual labels:  kaldi
datalinguist
Stanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+615.38%)
Mutual labels:  computational-linguistics
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+1100%)
Mutual labels:  kaldi
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+3038.46%)
Mutual labels:  speech
browser-apis
🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (+61.54%)
Mutual labels:  speech
Vosk
VOSK Speech Recognition Toolkit
Stars: ✭ 182 (+1300%)
Mutual labels:  speech-to-text
Elpis
🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (+676.92%)
Mutual labels:  kaldi
Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (+2853.85%)
Mutual labels:  speech
Factorized Tdnn
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (+653.85%)
Mutual labels:  kaldi
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (+69.23%)
Mutual labels:  speech
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+2684.62%)
Mutual labels:  speech
srvk-eesen-offline-transcriber
Top level code to transcribe English audio/video files into text/subtitles
Stars: ✭ 22 (+69.23%)
Mutual labels:  kaldi
torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (+53.85%)
Mutual labels:  kaldi
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+6369.23%)
Mutual labels:  speech-to-text
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+246.15%)
Mutual labels:  speech
2018-dlsl
UPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (+38.46%)
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+492.31%)
Mutual labels:  speech
Voice activity detection
Voice Activity Detection based on Deep Learning & TensorFlow
Stars: ✭ 132 (+915.38%)
Mutual labels:  speech
Deepspeech Server
A testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+1253.85%)
Mutual labels:  speech-to-text
301-360 of 388 similar projects