Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → AASHISHAG → DeepSpeech-API

AASHISHAG / DeepSpeech-API

Licence: other

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Programming Languages

32286 projects

139335 projects - #7 most used programming language

75241 projects

184084 projects - #8 most used programming language

56736 projects

Labels

speech-recognition speech-to-text mozilla-deepspeech

Projects that are alternatives of or similar to DeepSpeech-API

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+477.42%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (+93.55%)

Mutual labels: speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+70.97%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-19.35%)

Mutual labels: speech-recognition, speech-to-text

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (+12.9%)

Mutual labels: speech-recognition, speech-to-text

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+1041.94%)

Mutual labels: speech-recognition, speech-to-text

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+187.1%)

Mutual labels: speech-recognition, speech-to-text

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-32.26%)

Mutual labels: speech-recognition, speech-to-text

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (+22.58%)

Mutual labels: speech-recognition, speech-to-text

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-3.23%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (-22.58%)

Mutual labels: speech-recognition, speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+61.29%)

Mutual labels: speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (+12.9%)

Mutual labels: speech-recognition, speech-to-text

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+561.29%)

Mutual labels: speech-recognition, speech-to-text

Voice control for your websites and applications

Stars: ✭ 53 (+70.97%)

Mutual labels: speech-recognition, speech-to-text

Rev.ai Java SDK

Stars: ✭ 16 (-48.39%)

Mutual labels: speech-recognition, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+680.65%)

Mutual labels: speech-recognition, speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+716.13%)

Mutual labels: speech-recognition, speech-to-text

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (+122.58%)

Mutual labels: speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-32.26%)

Mutual labels: speech-recognition, speech-to-text

View All Similar Projects ➔

DeepSpeech-API

Project DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques, based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow project to make the implementation easier.

The intent of this project DeepSpeech-API is to enable the user to access DeepSpeech on a web browser. You can quickly install the dependencies on any platform (Windows/IOS/Linux) and start using it over the Web (Computer/Mobile).

Installing DeepSpeech Python bindings

$ pip3 install deepspeech

Getting the pre-trained model

If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page. Alternatively, you can run the following command to download and unzip the files in your current directory:

wget -O - https://github.com/mozilla/DeepSpeech/releases/download/v0.3.0/deepspeech-0.3.0-models.tar.gz | tar xvfz -

Runnning DeepSpeech-API

[Frontend](https://github.com/AASHISHAG/DeepSpeech-API/tree/master/frontend)

[Backend](https://github.com/AASHISHAG/DeepSpeech-API/tree/master/backend)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 31

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗