All Projects → Nodejs Speech → Similar Projects or Alternatives

304 Open source projects that are alternatives of or similar to Nodejs Speech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (-55.05%)

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-10.09%)

Mutual labels: speech, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-67.16%)

Mutual labels: speech, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-76.51%)

Mutual labels: speech, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (-55.6%)

Mutual labels: speech, speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+1040.55%)

Mutual labels: speech, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-77.43%)

Mutual labels: speech, speech-to-text

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-97.61%)

Mutual labels: speech, speech-to-text

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+123.67%)

Mutual labels: speech, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-62.39%)

Mutual labels: speech, speech-to-text

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-27.89%)

Mutual labels: speech, speech-to-text

Css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Stars: ✭ 302 (-44.59%)

Mutual labels: speech, speech-to-text

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (+86.61%)

Mutual labels: speech, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-69.72%)

Mutual labels: speech, speech-to-text

Soloud

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (+92.29%)

Mutual labels: speech, speech-to-text

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-87.34%)

Mutual labels: speech, speech-to-text

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-93.58%)

Mutual labels: speech, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-96.15%)

Mutual labels: speech, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-62.39%)

Mutual labels: speech, speech-to-text

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-89.54%)

Mutual labels: speech, speech-to-text

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-88.26%)

Mutual labels: speech, speech-to-text

Lingvo

Stars: ✭ 2,361 (+333.21%)

Mutual labels: speech, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+1946.06%)

Mutual labels: speech, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (-83.67%)

Mutual labels: speech, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-97.43%)

Mutual labels: speech, speech-to-text

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-93.58%)

Mutual labels: speech, speech-to-text

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-90.28%)

Mutual labels: speech, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-84.95%)

Mutual labels: speech, speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-2.39%)

Mutual labels: speech, speech-to-text

Sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (-47.16%)

Mutual labels: speech

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-49.17%)

Mutual labels: speech-to-text

Speech Vad Demo

集成Webrtc的VAD，用于切分音频文件

Stars: ✭ 259 (-52.48%)

Mutual labels: speech

Speech Demo

语音api示例

Stars: ✭ 454 (-16.7%)

Mutual labels: speech-to-text

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-29.72%)

Mutual labels: speech-to-text

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-52.48%)

Mutual labels: speech

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (-57.98%)

Mutual labels: speech

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (-29.54%)

Mutual labels: speech

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-91.01%)

Mutual labels: speech

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-83.85%)

Mutual labels: speech

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (-9.54%)

Mutual labels: speech

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-19.27%)

Mutual labels: speech-to-text

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (-44.04%)

Mutual labels: speech

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-95.41%)

Mutual labels: speech

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-95.96%)

Mutual labels: speech-to-text

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-33.58%)

Mutual labels: speech

flite-go

Go bindings for Flite (festival-lite)

Stars: ✭ 14 (-97.43%)

Mutual labels: speech

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (-19.82%)

Mutual labels: speech

Inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (-35.41%)

Mutual labels: speech

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-87.16%)

Mutual labels: speech-to-text

tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

Stars: ✭ 37 (-93.21%)

Mutual labels: speech

Autoedit 2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Stars: ✭ 343 (-37.06%)

Mutual labels: speech-to-text

BangalASR

Transformer based Bangla Speech Recognition

Stars: ✭ 20 (-96.33%)

Mutual labels: speech-to-text

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple