Lip Reading Deeplearning🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Stars: ✭ 1,641 (+1008.78%)
UnityandroidspeechrecognitionThis repository is a Unity plugin for Android Speech Recognition (based on Java implementation)
Stars: ✭ 73 (-50.68%)
Rnn NluA TensorFlow implementation of Recurrent Neural Networks for Sequence Classification and Sequence Labeling
Stars: ✭ 463 (+212.84%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (+209.46%)
Sarcasm DetectionDetecting Sarcasm on Twitter using both traditonal machine learning and deep learning techniques.
Stars: ✭ 73 (-50.68%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+165.54%)
Patterspeech-to-text in pytorch
Stars: ✭ 71 (-52.03%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+164.86%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-30.41%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+152.7%)
Torch AcRecurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-52.7%)
Midi RnnGenerate monophonic melodies with machine learning using a basic LSTM RNN
Stars: ✭ 124 (-16.22%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+153.38%)
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-31.08%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+2962.84%)
Deep PlantDeep-Plant: Plant Classification with CNN/RNN. It consists of CAFFE/Tensorflow implementation of our PR-17, TIP-18 (HGO-CNN & PlantStructNet) and MalayaKew dataset.
Stars: ✭ 66 (-55.41%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+139.19%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (+131.76%)
DogtorchWho Let The Dogs Out? Modeling Dog Behavior From Visual Data https://arxiv.org/pdf/1803.10827.pdf
Stars: ✭ 66 (-55.41%)
Personality DetectionImplementation of a hierarchical CNN based model to detect Big Five personality traits
Stars: ✭ 338 (+128.38%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+831.08%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+12521.62%)
Rcnn Relation ExtractionTensorflow Implementation of Recurrent Convolutional Neural Network for Relation Extraction
Stars: ✭ 64 (-56.76%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (+114.86%)
Theano lstm🔬 Nano size Theano LSTM module
Stars: ✭ 310 (+109.46%)
Predrnn PytorchOfficial implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.
Stars: ✭ 59 (-60.14%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+816.89%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+101.35%)
Ios mlList of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Stars: ✭ 1,409 (+852.03%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+202.7%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (+93.92%)
ParaphraserSentence paraphrase generation at the sentence level
Stars: ✭ 283 (+91.22%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+87.16%)
RnnsharpRNNSharp is a toolkit of deep recurrent neural network which is widely used for many different kinds of tasks, such as sequence labeling, sequence-to-sequence and so on. It's written by C# language and based on .NET framework 4.6 or above versions. RNNSharp supports many different types of networks, such as forward and bi-directional network, sequence-to-sequence network, and different types of layers, such as LSTM, Softmax, sampled Softmax and others.
Stars: ✭ 277 (+87.16%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-61.49%)
Wer are weAttempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Stars: ✭ 1,684 (+1037.84%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+83.11%)
BiglittlenetOfficial repository for Big-Little Net
Stars: ✭ 57 (-61.49%)
Lstm Human Activity RecognitionHuman Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier
Stars: ✭ 2,943 (+1888.51%)
Mad TwinnetThe code for the MaD TwinNet. Demo page:
Stars: ✭ 99 (-33.11%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+197.3%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+899.32%)
Rnn TrajmodelThe source of the IJCAI2017 paper "Modeling Trajectory with Recurrent Neural Networks"
Stars: ✭ 72 (-51.35%)