watson-speech-translatorUse Watson Speech to Text, Language Translator, and Text to Speech in a web app with React components
Stars: ✭ 66 (+78.38%)
ruby-sdk♦️ Ruby SDK to use the IBM Watson services.
Stars: ✭ 45 (+21.62%)
speech-to-textPython helper for Google and IBM Watson speech-to-text cloud APIs.
Stars: ✭ 14 (-62.16%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+181.08%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+9859.46%)
watson-discovery-food-reviewsCombine Watson Knowledge Studio and Watson Discovery to discover customer sentiment from product reviews
Stars: ✭ 36 (-2.7%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-27.03%)
cloudco-insuranceA modern insurance company. The application showcases cognitive and cloud computing ideas in the context of insurance.
Stars: ✭ 43 (+16.22%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+2.7%)
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+429.73%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2172.97%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+454.05%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+43.24%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+583.78%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+86.49%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-43.24%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-5.41%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+427.03%)
LingvoLingvo
Stars: ✭ 2,361 (+6281.08%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+429.73%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+121.62%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+494.59%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+554.05%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+410.81%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (+72.97%)
Autoedit 2Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Stars: ✭ 343 (+827.03%)
Node Sdk☄️ Node.js library to access IBM Watson services.
Stars: ✭ 1,471 (+3875.68%)
watson-vehicle-damage-analyzerA server and mobile app to send pictures of vehicle damage to IBM Watson Visual Recognition for classification
Stars: ✭ 62 (+67.57%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-43.24%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-62.16%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+5624.32%)
youtube-video-maker📹 A tool for automatic video creation and uploading on YouTube
Stars: ✭ 134 (+262.16%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-18.92%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+383.78%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+140.54%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-16.22%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-62.16%)
Watson-Unity-ARKit# WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.
Stars: ✭ 24 (-35.14%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+454.05%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+35.14%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+75.68%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+62.16%)
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+375.68%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+391.89%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+856.76%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-51.35%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-43.24%)