ebu / benchmarkstt

Licence: MIT license

Open Source AI Benchmarking toolkit for benchmarking speech to text services

Programming Languages

python

139335 projects - #7 most used programming language

HTML

75241 projects

Projects that are alternatives of or similar to benchmarkstt

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+411.63%)

Mutual labels: speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (-18.6%)

Mutual labels: speech-to-text

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (+60.47%)

Mutual labels: speech-to-text

Stt

🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production

Stars: ✭ 197 (+358.14%)

Mutual labels: speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-51.16%)

Mutual labels: speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+376.74%)

Mutual labels: speech-to-text

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+355.81%)

Mutual labels: speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-51.16%)

Mutual labels: speech-to-text

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+23.26%)

Mutual labels: speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (-62.79%)

Mutual labels: speech-to-text

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+469.77%)

Mutual labels: speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+488.37%)

Mutual labels: speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+316.28%)

Mutual labels: speech-to-text

Go Astibob

Golang framework to build an AI that can understand and speak back to you, and everything else you want

Stars: ✭ 222 (+416.28%)

Mutual labels: speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-30.23%)

Mutual labels: speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+376.74%)

Mutual labels: speech-to-text

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+723.26%)

Mutual labels: speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+106.98%)

Mutual labels: speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-11.63%)

Mutual labels: speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+23.26%)

Mutual labels: speech-to-text

View All Similar Projects ➔

BenchmarkSTT

About

This is a command line tool for benchmarking Automatic Speech Recognition engines.

It is designed for non-academic production environments, and prioritises ease of use and relative benchmarking over scientific procedure and high-accuracy absolute scoring.

Because of the wide range of languages, algorithms and audio characteristics, no single STT engine can be expected to excel in all circumstances. For this reason, this tool places responsibility on the users to design their own benchmarking procedure and to decide, based on the combination of test data and metrics, which engine is best suited for their particular use case.

Usage examples

Returns the number of word insertions, deletions, replacements and matches for the hypothesis transcript compared to the reference:

benchmarkstt --reference reference.txt --hypothesis hypothesis.txt --diffcounts

Returns the Word Error Rate after lowercasing both reference and hypothesis. This normlization improves the accuracy of the Word Error Rate as it removes diffs that might otherwise be considered errors:

benchmarkstt -r reference.txt -h hypothesis.txt --wer --lowercase

Returns a visual diff after applying all the normalization rules specified in the config file:

benchmarkstt -r reference.txt -h hypothesis.txt --worddiffs --config conf

Further information

This is a collaborative project to create a library for benchmarking AI/ML applications. It was created in response to the needs of broadcasters and providers of Access Services to media organisations, but anyone is welcome to contribute. The group behind this project is the EBU's Media Information Management & AI group.

Currently the group is focussing on Speech-to-Text, but it will consider creating benchmarking tools for other AI/ML services.

For general information about this project, including the motivations and guiding principles, please see the project wiki

To install and start using the tool, go to the documentation.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

ebu / benchmarkstt

Programming Languages

Labels

Projects that are alternatives of or similar to benchmarkstt

BenchmarkSTT

About

Usage examples

Further information