KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-77.88%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-87.61%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-0.88%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (-37.17%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-87.61%)
titanium-speechUse the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.
Stars: ✭ 21 (-81.42%)
KodiSharpUse Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (-80.53%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-52.21%)
praiseDo stuff with your voice in the browser.
Stars: ✭ 13 (-88.5%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-66.37%)
converseConversational text Analysis using various NLP techniques
Stars: ✭ 147 (+30.09%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+179.65%)
FAST-RIRThis is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (-20.35%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-76.99%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-87.61%)
cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Stars: ✭ 140 (+23.89%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-27.43%)
specAugmentTensor2tensor experiment with SpecAugment
Stars: ✭ 46 (-59.29%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-42.48%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (-44.25%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+81.42%)
multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Stars: ✭ 122 (+7.96%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-76.11%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-76.99%)
EmotiW2018No description or website provided.
Stars: ✭ 83 (-26.55%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-78.76%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+123.89%)
Emotion-RecognitionEmotion recognition from EEG and physiological signals using deep neural networks
Stars: ✭ 35 (-69.03%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-55.75%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-84.07%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+119.47%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-16.81%)
AGHMNImplementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.
Stars: ✭ 25 (-77.88%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-25.66%)
emoticCode repo for the EMOTIC dataset
Stars: ✭ 93 (-17.7%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-58.41%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (-31.86%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+94.69%)
RECCONThis repository contains the dataset and the PyTorch implementations of the models from the paper Recognizing Emotion Cause in Conversations.
Stars: ✭ 126 (+11.5%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+120.35%)
favorite-research-papersListing my favorite research papers 📝 from different fields as I read them.
Stars: ✭ 12 (-89.38%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+114.16%)
emotion-recognition-GANThis project is a semi-supervised approach to detect emotions on faces in-the-wild using GAN
Stars: ✭ 20 (-82.3%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+3161.06%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-88.5%)
DragonflySpeech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Stars: ✭ 209 (+84.96%)
Emotion and Polarity SOAn emotion classifier of text containing technical content from the SE domain
Stars: ✭ 74 (-34.51%)
Openpose-based-GUI-for-Realtime-Pose-Estimate-and-Action-RecognitionGUI based on the python api of openpose in windows using cuda10 and cudnn7. Support body , hand, face keypoints estimation and data saving. Realtime gesture recognition is realized through two-layer neural network based on the skeleton collected from the gui.
Stars: ✭ 69 (-38.94%)
SpeechEmoRecSpeech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
Stars: ✭ 44 (-61.06%)