deep-char-cnn-lstmDeep Character CNN LSTM Encoder with Classification and Similarity Models
Stars: ✭ 20 (-60%)
TF-Model-Deploy-TutorialA tutorial exploring multiple approaches to deploy a trained TensorFlow (or Keras) model or multiple models for prediction.
Stars: ✭ 51 (+2%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-26%)
GestureAIRNN(Recurrent Nerural network) model which recognize hand-gestures drawing 5 figures.
Stars: ✭ 20 (-60%)
extkerasPlayground for implementing custom layers and other components compatible with keras, with the purpose to learn the framework better and perhaps in future offer some utils for others.
Stars: ✭ 18 (-64%)
digit-recognizer-liveRecognize Digits using Deep Neural Networks in Google Chrome live!
Stars: ✭ 29 (-42%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+64%)
keras-deep-learningVarious implementations and projects on CNN, RNN, LSTM, GAN, etc
Stars: ✭ 22 (-56%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+108%)
OCROptical character recognition Using Deep Learning
Stars: ✭ 25 (-50%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-42%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+178%)
fall-detection-two-stream-cnnReal-time fall detection using two-stream convolutional neural net (CNN) with Motion History Image (MHI)
Stars: ✭ 49 (-2%)
datastories-semeval2017-task6Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-60%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+52%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (+42%)
lstm-mathNeural network that solves math equations on the character level
Stars: ✭ 26 (-48%)
tf-faster-rcnnTensorflow 2 Faster-RCNN implementation from scratch supporting to the batch processing with MobileNetV2 and VGG16 backbones
Stars: ✭ 88 (+76%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+84%)
keras-complexKeras-Tensorflow implementation of complex-valued convolutional neural networks
Stars: ✭ 96 (+92%)
ConvLSTM-PyTorchConvLSTM/ConvGRU (Encoder-Decoder) with PyTorch on Moving-MNIST
Stars: ✭ 202 (+304%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+126%)
WaveGrad2PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Stars: ✭ 55 (+10%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-48%)
theano-recurrenceRecurrent Neural Networks (RNN, GRU, LSTM) and their Bidirectional versions (BiRNN, BiGRU, BiLSTM) for word & character level language modelling in Theano
Stars: ✭ 40 (-20%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (+138%)
bert-squeeze🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
Stars: ✭ 56 (+12%)
ntua-slp-semeval2018Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.
Stars: ✭ 79 (+58%)
deep-transTransliterating English to Hindi using Recurrent Neural Networks
Stars: ✭ 44 (-12%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+310%)
AttentionRepository for Attention Algorithm
Stars: ✭ 39 (-22%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1582%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-52%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-28%)
CS231nPyTorch/Tensorflow solutions for Stanford's CS231n: "CNNs for Visual Recognition"
Stars: ✭ 47 (-6%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-30%)
SentimentAnalysis(BOW, TF-IDF, Word2Vec, BERT) Word Embeddings + (SVM, Naive Bayes, Decision Tree, Random Forest) Base Classifiers + Pre-trained BERT on Tensorflow Hub + 1-D CNN and Bi-Directional LSTM on IMDB Movie Reviews Dataset
Stars: ✭ 40 (-20%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (-30%)
keras tfrecordExtending Keras to support tfrecord dataset
Stars: ✭ 61 (+22%)
Show and TellShow and Tell : A Neural Image Caption Generator
Stars: ✭ 74 (+48%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-46%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+4668%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (-34%)
SpectrumSpectrum is an AI that uses machine learning to generate Rap song lyrics
Stars: ✭ 37 (-26%)