JSpeakA Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features
Stars: ✭ 16 (-54.29%)
algorithmiaNo description or website provided.
Stars: ✭ 15 (-57.14%)
pred-rnnPredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs
Stars: ✭ 115 (+228.57%)
Unity-Text-to-SpeechSample app used to demonstrate the use of Microsoft Cognitive Services Text-to-Speech APIs (aka Speech Synthesis) from within Unity.
Stars: ✭ 67 (+91.43%)
plivo-pythonA Python library for communicating with the Plivo API and generating Plivo XML.
Stars: ✭ 57 (+62.86%)
bert-squeeze🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
Stars: ✭ 56 (+60%)
BiLSTM-CRF-NER-PyTorchThis repo contains a PyTorch implementation of a BiLSTM-CRF model for named entity recognition task.
Stars: ✭ 109 (+211.43%)
hermes-audio-serverAn open source implementation of the audio server part of the Hermes protocol
Stars: ✭ 23 (-34.29%)
spokestack-tray-androidA UI component that makes it easy to add voice interaction to your app.
Stars: ✭ 13 (-62.86%)
DrowsyDriverDetectionThis is a project implementing Computer Vision and Deep Learning concepts to detect drowsiness of a driver and sound an alarm if drowsy.
Stars: ✭ 82 (+134.29%)
UniSpyServerAn Open source GameSpy emulator written in C#
Stars: ✭ 110 (+214.29%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (+102.86%)
ArrayLSTMGPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..
Stars: ✭ 21 (-40%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (+2.86%)
EBIM-NLIEnhanced BiLSTM Inference Model for Natural Language Inference
Stars: ✭ 24 (-31.43%)
extkerasPlayground for implementing custom layers and other components compatible with keras, with the purpose to learn the framework better and perhaps in future offer some utils for others.
Stars: ✭ 18 (-48.57%)
LSTM-sentiment-analysisLSTM sentiment analysis. Please look at my another repo for SVM and Naive algorithem
Stars: ✭ 19 (-45.71%)
cookiecutter-flask-askCookiecutter template for Alexa skills based on the fantastic Flask-Ask framework 🍾🗣❓
Stars: ✭ 51 (+45.71%)
QuietVRA Quiet Place in VR: Generate any 3D object with your voice. It's magic!
Stars: ✭ 17 (-51.43%)
UniSpySDKUpdated and Cleaned GameSpy SDK
Stars: ✭ 31 (-11.43%)
datastories-semeval2017-task6Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-42.86%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-48.57%)
Tess4AndroidA new fork base on tess-two and Tesseract 4.0.0
Stars: ✭ 31 (-11.43%)
deep-char-cnn-lstmDeep Character CNN LSTM Encoder with Classification and Similarity Models
Stars: ✭ 20 (-42.86%)
MachineLearningImplementations of machine learning algorithm by Python 3
Stars: ✭ 16 (-54.29%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-57.14%)
lstm-numpyVanilla LSTM with numpy
Stars: ✭ 17 (-51.43%)
air writingOnline Hand Writing Recognition using BLSTM
Stars: ✭ 26 (-25.71%)
speaker.appSource code for https://speaker.app, a batteries-included, web-based, quasi-decentralized, WebRTC networking platform, with a primary focus on audio and screen-sharing, and a secondary focus on chat messages and peripheral features.
Stars: ✭ 26 (-25.71%)
QTextRecognizerA gui for tesseractOCR with some preprocessing image options (OpenCV) for improve character recognition
Stars: ✭ 27 (-22.86%)
voiceImplementation of the Discord Voice API for discord.js and other JS/TS libraries
Stars: ✭ 310 (+785.71%)
R UnetVideo prediction using lstm and unet
Stars: ✭ 25 (-28.57%)
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (+8.57%)
SwiftyWaveSiri Waves View in Swift
Stars: ✭ 66 (+88.57%)
tf-ran-cellRecurrent Additive Networks for Tensorflow
Stars: ✭ 16 (-54.29%)
Show and TellShow and Tell : A Neural Image Caption Generator
Stars: ✭ 74 (+111.43%)
voice gender detection♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (+45.71%)
SpeakerDiarization RNN CNN LSTMSpeaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
Stars: ✭ 56 (+60%)
novel writerTrain LSTM to writer novel (HongLouMeng here) in Pytorch.
Stars: ✭ 14 (-60%)
pose2actionexperiments on classifying actions using poses
Stars: ✭ 24 (-31.43%)