All Projects → Vad → Similar Projects or Alternatives

1475 Open source projects that are alternatives of or similar to Vad

Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+237.14%)
Mutual labels:  lstm, speech-recognition, speech, dnn
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-79.42%)
Mutual labels:  data, speech-recognition, speech
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-34.41%)
Mutual labels:  speech-recognition, attention, speech
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-21.22%)
Mutual labels:  speech-recognition, speech
Image Caption Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Stars: ✭ 126 (-79.74%)
Mutual labels:  lstm, attention
Tts Cube
End-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (-65.76%)
Mutual labels:  lstm, speech
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-71.22%)
Mutual labels:  speech, speech-recognition
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-67.04%)
Mutual labels:  speech, speech-recognition
QTextRecognizer
A gui for tesseractOCR with some preprocessing image options (OpenCV) for improve character recognition
Stars: ✭ 27 (-95.66%)
Mutual labels:  dnn, lstm
learningspoons
nlp lecture-notes and source code
Stars: ✭ 29 (-95.34%)
Mutual labels:  lstm, attention
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-63.99%)
Mutual labels:  speech, speech-recognition
Nlp Models Tensorflow
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Stars: ✭ 1,603 (+157.72%)
Mutual labels:  lstm, attention
Chinese Chatbot
中文聊天机器人,基于10万组对白训练而成,采用注意力机制,对一般问题都会生成一个有意义的答复。已上传模型,可直接运行,跑不起来直播吃键盘。
Stars: ✭ 124 (-80.06%)
Mutual labels:  lstm, attention
Datastories Semeval2017 Task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (-70.42%)
Mutual labels:  lstm, attention
Multimodal Sentiment Analysis
Attention-based multimodal fusion for sentiment analysis
Stars: ✭ 172 (-72.35%)
Mutual labels:  lstm, attention
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-86.5%)
Mutual labels:  speech, speech-recognition
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (-90.68%)
Mutual labels:  speech, speech-recognition
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-85.69%)
Mutual labels:  speech, speech-recognition
Pytorch Seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
Stars: ✭ 3,418 (+449.52%)
Mutual labels:  lstm, attention
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-14.47%)
Mutual labels:  speech-recognition, speech
dnn-lstm-word-segment
Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network
Stars: ✭ 24 (-96.14%)
Mutual labels:  dnn, lstm
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-96.62%)
Mutual labels:  lstm, speech-recognition
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-94.37%)
Mutual labels:  speech, speech-recognition
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (-91.96%)
Mutual labels:  lstm, speech-recognition
Base-On-Relation-Method-Extract-News-DA-RNN-Model-For-Stock-Prediction--Pytorch
基於關聯式新聞提取方法之雙階段注意力機制模型用於股票預測
Stars: ✭ 33 (-94.69%)
Mutual labels:  lstm, attention
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-80.23%)
Mutual labels:  speech, speech-recognition
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (+1684.57%)
Mutual labels:  lstm, attention
Cnn lstm for text classify
CNN, LSTM, NBOW, fasttext 中文文本分类
Stars: ✭ 90 (-85.53%)
Mutual labels:  lstm, attention
Self Attention Classification
document classification using LSTM + self attention
Stars: ✭ 84 (-86.5%)
Mutual labels:  lstm, attention
voice-conversion
an tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-95.82%)
Mutual labels:  dnn, lstm
Crnn attention ocr chinese
CRNN with attention to do OCR,add Chinese recognition
Stars: ✭ 315 (-49.36%)
Mutual labels:  lstm, attention
Deep Time Series Prediction
Seq2Seq, Bert, Transformer, WaveNet for time series prediction.
Stars: ✭ 183 (-70.58%)
Mutual labels:  lstm, attention
Rnn For Joint Nlu
Pytorch implementation of "Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling" (https://arxiv.org/abs/1609.01454)
Stars: ✭ 176 (-71.7%)
Mutual labels:  lstm, attention
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-64.63%)
Mutual labels:  lstm, speech-recognition
Machine Learning
My Attempt(s) In The World Of ML/DL....
Stars: ✭ 78 (-87.46%)
Mutual labels:  lstm, attention
Lhotse
Stars: ✭ 236 (-62.06%)
Mutual labels:  data, speech
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-91.48%)
Mutual labels:  speech, speech-recognition
Automatic speech recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 2,751 (+342.28%)
Mutual labels:  lstm, speech-recognition
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-96.62%)
Mutual labels:  speech, speech-recognition
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-96.62%)
Mutual labels:  speech, speech-recognition
EBIM-NLI
Enhanced BiLSTM Inference Model for Natural Language Inference
Stars: ✭ 24 (-96.14%)
Mutual labels:  lstm, attention
Pointer Networks Experiments
Sorting numbers with pointer networks
Stars: ✭ 53 (-91.48%)
Mutual labels:  lstm, attention
LearningMetersPoems
Official repo of the article: Yousef, W. A., Ibrahime, O. M., Madbouly, T. M., & Mahmoud, M. A. (2019), "Learning meters of arabic and english poems with recurrent neural networks: a step forward for language understanding and synthesis", arXiv preprint arXiv:1905.05700
Stars: ✭ 18 (-97.11%)
Mutual labels:  dnn, lstm
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-86.82%)
Mutual labels:  speech, speech-recognition
iPerceive
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
Stars: ✭ 52 (-91.64%)
Mutual labels:  lstm, attention
datastories-semeval2017-task6
Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-96.78%)
Mutual labels:  lstm, attention
ntua-slp-semeval2018
Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.
Stars: ✭ 79 (-87.3%)
Mutual labels:  lstm, attention
automatic-personality-prediction
[AAAI 2020] Modeling Personality with Attentive Networks and Contextual Embeddings
Stars: ✭ 43 (-93.09%)
Mutual labels:  lstm, attention
myDL
Deep Learning
Stars: ✭ 18 (-97.11%)
Mutual labels:  dnn, lstm
Hierarchical-Word-Sense-Disambiguation-using-WordNet-Senses
Word Sense Disambiguation using Word Specific models, All word models and Hierarchical models in Tensorflow
Stars: ✭ 33 (-94.69%)
Mutual labels:  lstm, attention
speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-90.19%)
Mutual labels:  dnn, speech-recognition
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-91.64%)
Mutual labels:  speech, speech-recognition
Pocketsphinx Python
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-52.09%)
Mutual labels:  speech-recognition, speech
dhs summit 2019 image captioning
Image captioning using attention models
Stars: ✭ 34 (-94.53%)
Mutual labels:  lstm, attention
Time Attention
Implementation of RNN for Time Series prediction from the paper https://arxiv.org/abs/1704.02971
Stars: ✭ 52 (-91.64%)
Mutual labels:  lstm, attention
Text Classification Keras
📚 Text classification library with Keras
Stars: ✭ 53 (-91.48%)
Mutual labels:  lstm, attention
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-97.75%)
Mutual labels:  speech, speech-recognition
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-74.6%)
Mutual labels:  speech, dnn
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-36.82%)
Mutual labels:  speech-recognition, speech
Specaugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-34.41%)
Mutual labels:  speech-recognition, speech
1-60 of 1475 similar projects