All Projects → skit-ai → speech-recognition

skit-ai / speech-recognition

Licence: Apache-2.0 License
SDKs and docs for Skit's speech to text service

Programming Languages

python
139335 projects - #7 most used programming language
java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to speech-recognition

Lingvo
Lingvo
Stars: ✭ 2,361 (+11705%)
Mutual labels:  speech-recognition, speech-to-text, asr
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (+5%)
Mutual labels:  speech-recognition, speech-to-text, asr
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+925%)
Mutual labels:  speech-recognition, speech-to-text, asr
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+6685%)
Mutual labels:  speech-recognition, speech-to-text, asr
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+165%)
Mutual labels:  speech-recognition, speech-to-text, asr
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+540%)
Mutual labels:  speech-recognition, speech-to-text, asr
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (+35%)
Mutual labels:  speech-recognition, speech-to-text, asr
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (+185%)
Mutual labels:  speech-recognition, speech-to-text, asr
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+795%)
Mutual labels:  speech-recognition, speech-to-text, asr
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+925%)
Mutual labels:  speech-recognition, speech-to-text, asr
Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (+385%)
Mutual labels:  speech-recognition, speech-to-text, asr
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+160%)
Mutual labels:  speech-recognition, speech-to-text, asr
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (+290%)
Mutual labels:  speech-recognition, speech-to-text, asr
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+655%)
Mutual labels:  speech-recognition, speech-to-text, asr
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (+245%)
Mutual labels:  speech-recognition, speech-to-text, asr
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+5%)
Mutual labels:  speech-recognition, speech-to-text, asr
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+2510%)
Mutual labels:  speech-recognition, speech-to-text, asr
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+3590%)
Mutual labels:  speech-recognition, speech-to-text, asr
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+1670%)
Mutual labels:  speech-recognition, speech-to-text, asr
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (+25%)
Mutual labels:  speech-recognition, speech-to-text, asr

Speech-to-Text API

Converts audio to text

We support these ten indian languages (language codes).

  • Hindi
  • English
  • Marathi
  • Kannada
  • Malayalam
  • Bengali
  • Gujarati
  • Punjabi
  • Telugu
  • Tamil

Authentication

To get access to our APIs reach out to us at [email protected] We do not provide public access token for the APIs anymore.

Ways to use the Service

  • Transcribing short audios [audios upto 1 min]
  • Transcribing long audios [more than 1 min]
  • Transcribing audio from streaming input

We recommend that you call this service using Vernacular provided client libraries. If your application needs to call this service using your own libraries, you should use the HTTP Endpoints.

Supported SDKs: Python

REST Reference

ServiceHost: https://asr.vernacular.ai

Speech Recognition

Name Description
recognize Performs synchronous speech recognition: receive results after all audio has been sent and processed.
longrunningrecognize Performs asynchronous speech recognition. Generally used for long audios

RPC Reference

Speech Recognition

Methods Description
Recognize Performs synchronous speech recognition: receive results after all audio has been sent and processed.
LongRunningRecognize Performs asynchronous speech recognition: receive results via the longrunning.Operations interface.
StreamingRecognize Performs streaming speech recognition: receive results while sending audio. Supports both unidirectional and bidirectional streaming.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].