All Projects → Silero Models → Similar Projects or Alternatives

6564 Open source projects that are alternatives of or similar to Silero Models

Vosk

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (-65.13%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

Stars: ✭ 176 (-66.28%)

Mutual labels: speech-recognition, speech-to-text

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (-63.6%)

Mutual labels: speech-recognition, asr

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-66.48%)

Mutual labels: speech-recognition, asr

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (-62.64%)

Mutual labels: speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-67.24%)

Mutual labels: speech-recognition, speech-to-text

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (-52.3%)

Mutual labels: speech-recognition, asr

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (-52.49%)

Mutual labels: speech-recognition, asr

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+21.26%)

Mutual labels: jupyter-notebook, speech-recognition

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (-53.64%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (-7.85%)

Mutual labels: speech-recognition, speech-to-text

Speech Emotion Recognition

Detecting emotions using MFCC features of human speech using Deep Learning

Stars: ✭ 89 (-82.95%)

Mutual labels: jupyter-notebook, speech-recognition

Glasses

High-quality Neural Networks for Computer Vision 😎

Stars: ✭ 138 (-73.56%)

Mutual labels: jupyter-notebook, pretrained-models

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+301.72%)

Mutual labels: speech-recognition, asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-95.4%)

Mutual labels: speech-recognition, asr

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+61.11%)

Mutual labels: speech-recognition, speech-to-text

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-95.02%)

Mutual labels: speech-recognition, speech-to-text

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-95.98%)

Mutual labels: speech-recognition, speech-to-text

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-80.08%)

Mutual labels: speech-recognition, asr

Deep-learning-And-Paper

【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等

Stars: ✭ 62 (-88.12%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-92.91%)

Mutual labels: speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-95.98%)

Mutual labels: speech-recognition, speech-to-text

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-93.49%)

Mutual labels: speech-recognition, speech-to-text

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (-12.64%)

Mutual labels: speech-recognition, asr

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-92.72%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (-88.51%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-96.55%)

Mutual labels: speech-recognition, speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (-94.06%)

Mutual labels: speech-recognition, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-68.39%)

Mutual labels: speech-recognition, speech-to-text

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+356.7%)

Mutual labels: speech-recognition, asr

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (-71.84%)

Mutual labels: speech-recognition, speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-97.32%)

Mutual labels: speech-recognition, speech-to-text

simple diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Stars: ✭ 26 (-95.02%)

Mutual labels: speech-to-text, asr

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-95.21%)

Mutual labels: speech-recognition, asr

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-84.29%)

Mutual labels: speech-recognition, speech-to-text

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (-80.08%)

Mutual labels: speech-recognition, speech-to-text

htk

HTK Toolkit with Linux 64 bit and Docker support

Stars: ✭ 14 (-97.32%)

Mutual labels: speech-recognition, speech-to-text

deepspeech

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Stars: ✭ 45 (-91.38%)

Mutual labels: speech-recognition, speech-to-text

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-93.1%)

Mutual labels: speech-recognition, asr

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-93.3%)

Mutual labels: speech-recognition, speech-to-text

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-90.04%)

Mutual labels: speech-recognition, asr

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+1539.85%)

Mutual labels: speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-6.13%)

Mutual labels: speech-recognition, speech-to-text

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-96.36%)

Mutual labels: speech-recognition, asr

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-89.85%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-88.31%)

Mutual labels: speech-recognition, speech-to-text

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-96.36%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-97.32%)

Mutual labels: speech-recognition, speech-to-text

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-48.08%)

Mutual labels: speech-recognition, asr

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-97.32%)

Mutual labels: speech-recognition, asr

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-46.93%)

Mutual labels: speech-recognition, speech-to-text

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-46.93%)

Mutual labels: speech-recognition, asr

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+3478.54%)

Mutual labels: speech-recognition, speech-to-text

Fbrs interactive segmentation

[CVPR2020] f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation https://arxiv.org/abs/2001.10331

Stars: ✭ 366 (-29.89%)

Mutual labels: jupyter-notebook, pretrained-models

Afinn

AFINN sentiment analysis in Python

Stars: ✭ 356 (-31.8%)

Mutual labels: english, jupyter-notebook

Zamia Speech

Open tools and data for cloudless automatic speech recognition