All Projects → Vad → Similar Projects or Alternatives

1475 Open source projects that are alternatives of or similar to Vad

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+237.14%)

Mutual labels: lstm, speech-recognition, speech, dnn

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-79.42%)

Mutual labels: data, speech-recognition, speech

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-34.41%)

Mutual labels: speech-recognition, attention, speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-21.22%)

Mutual labels: speech-recognition, speech

Image Caption Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Stars: ✭ 126 (-79.74%)

Mutual labels: lstm, attention

Tts Cube

End-2-end speech synthesis with recurrent neural networks

Stars: ✭ 213 (-65.76%)

Mutual labels: lstm, speech

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-71.22%)

Mutual labels: speech, speech-recognition

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-67.04%)

Mutual labels: speech, speech-recognition

QTextRecognizer

A gui for tesseractOCR with some preprocessing image options (OpenCV) for improve character recognition

Stars: ✭ 27 (-95.66%)

Mutual labels: dnn, lstm

learningspoons

nlp lecture-notes and source code

Stars: ✭ 29 (-95.34%)

Mutual labels: lstm, attention

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-63.99%)

Mutual labels: speech, speech-recognition

Nlp Models Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Stars: ✭ 1,603 (+157.72%)

Mutual labels: lstm, attention

Chinese Chatbot

中文聊天机器人，基于10万组对白训练而成，采用注意力机制，对一般问题都会生成一个有意义的答复。已上传模型，可直接运行，跑不起来直播吃键盘。

Stars: ✭ 124 (-80.06%)

Mutual labels: lstm, attention

Datastories Semeval2017 Task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".

Stars: ✭ 184 (-70.42%)

Mutual labels: lstm, attention

Multimodal Sentiment Analysis

Attention-based multimodal fusion for sentiment analysis

Stars: ✭ 172 (-72.35%)

Mutual labels: lstm, attention

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-86.5%)

Mutual labels: speech, speech-recognition

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (-90.68%)

Mutual labels: speech, speech-recognition

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (-85.69%)

Mutual labels: speech, speech-recognition

Pytorch Seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Stars: ✭ 3,418 (+449.52%)

Mutual labels: lstm, attention

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-14.47%)

Mutual labels: speech-recognition, speech

dnn-lstm-word-segment

Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network

Stars: ✭ 24 (-96.14%)

Mutual labels: dnn, lstm

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-96.62%)

Mutual labels: lstm, speech-recognition

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-94.37%)

Mutual labels: speech, speech-recognition

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-91.96%)

Mutual labels: lstm, speech-recognition

Base-On-Relation-Method-Extract-News-DA-RNN-Model-For-Stock-Prediction--Pytorch

基於關聯式新聞提取方法之雙階段注意力機制模型用於股票預測

Stars: ✭ 33 (-94.69%)

Mutual labels: lstm, attention

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-80.23%)

Mutual labels: speech, speech-recognition

Numpy Ml

Machine learning, in numpy

Stars: ✭ 11,100 (+1684.57%)

Mutual labels: lstm, attention

Cnn lstm for text classify

CNN, LSTM, NBOW, fasttext 中文文本分类

Stars: ✭ 90 (-85.53%)

Mutual labels: lstm, attention

Self Attention Classification

document classification using LSTM + self attention

Stars: ✭ 84 (-86.5%)

Mutual labels: lstm, attention

voice-conversion

an tutorial implement of voice conversion using pytorch

Stars: ✭ 26 (-95.82%)

Mutual labels: dnn, lstm

Crnn attention ocr chinese

CRNN with attention to do OCR,add Chinese recognition

Stars: ✭ 315 (-49.36%)

Mutual labels: lstm, attention

Deep Time Series Prediction

Seq2Seq, Bert, Transformer, WaveNet for time series prediction.

Stars: ✭ 183 (-70.58%)

Mutual labels: lstm, attention

Rnn For Joint Nlu

Pytorch implementation of "Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling" (https://arxiv.org/abs/1609.01454)

Stars: ✭ 176 (-71.7%)

Mutual labels: lstm, attention

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (-64.63%)

Mutual labels: lstm, speech-recognition

Machine Learning

My Attempt(s) In The World Of ML/DL....

Stars: ✭ 78 (-87.46%)

Mutual labels: lstm, attention

Lhotse

Stars: ✭ 236 (-62.06%)

Mutual labels: data, speech

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-91.48%)

Mutual labels: speech, speech-recognition

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+342.28%)

Mutual labels: lstm, speech-recognition

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-96.62%)

Mutual labels: speech, speech-recognition

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-96.62%)

Mutual labels: speech, speech-recognition

EBIM-NLI

Enhanced BiLSTM Inference Model for Natural Language Inference

Stars: ✭ 24 (-96.14%)

Mutual labels: lstm, attention

Pointer Networks Experiments

Sorting numbers with pointer networks

Stars: ✭ 53 (-91.48%)

Mutual labels: lstm, attention

LearningMetersPoems

Official repo of the article: Yousef, W. A., Ibrahime, O. M., Madbouly, T. M., & Mahmoud, M. A. (2019), "Learning meters of arabic and english poems with recurrent neural networks: a step forward for language understanding and synthesis", arXiv preprint arXiv:1905.05700

Stars: ✭ 18 (-97.11%)

Mutual labels: dnn, lstm

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-86.82%)

Mutual labels: speech, speech-recognition

iPerceive

Stars: ✭ 52 (-91.64%)

Mutual labels: lstm, attention

datastories-semeval2017-task6

Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".

Stars: ✭ 20 (-96.78%)