All Projects → Nodejs Speech → Similar Projects or Alternatives

304 Open source projects that are alternatives of or similar to Nodejs Speech

Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (-55.05%)
Mutual labels:  speech, speech-to-text
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-10.09%)
Mutual labels:  speech, speech-to-text
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-67.16%)
Mutual labels:  speech, speech-to-text
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-76.51%)
Mutual labels:  speech, speech-to-text
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-55.6%)
Mutual labels:  speech, speech-to-text
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+1040.55%)
Mutual labels:  speech, speech-to-text
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-77.43%)
Mutual labels:  speech, speech-to-text
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-97.61%)
Mutual labels:  speech, speech-to-text
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+123.67%)
Mutual labels:  speech, speech-to-text
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-62.39%)
Mutual labels:  speech, speech-to-text
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-27.89%)
Mutual labels:  speech, speech-to-text
Css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (-44.59%)
Mutual labels:  speech, speech-to-text
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+86.61%)
Mutual labels:  speech, speech-to-text
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (-69.72%)
Mutual labels:  speech, speech-to-text
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (+92.29%)
Mutual labels:  speech, speech-to-text
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-87.34%)
Mutual labels:  speech, speech-to-text
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-93.58%)
Mutual labels:  speech, speech-to-text
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-96.15%)
Mutual labels:  speech, speech-to-text
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-62.39%)
Mutual labels:  speech, speech-to-text
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-89.54%)
Mutual labels:  speech, speech-to-text
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-88.26%)
Mutual labels:  speech, speech-to-text
Lingvo
Lingvo
Stars: ✭ 2,361 (+333.21%)
Mutual labels:  speech, speech-to-text
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+1946.06%)
Mutual labels:  speech, speech-to-text
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-83.67%)
Mutual labels:  speech, speech-to-text
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-97.43%)
Mutual labels:  speech, speech-to-text
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-93.58%)
Mutual labels:  speech, speech-to-text
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-90.28%)
Mutual labels:  speech, speech-to-text
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-84.95%)
Mutual labels:  speech, speech-to-text
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-2.39%)
Mutual labels:  speech, speech-to-text
Sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (-47.16%)
Mutual labels:  speech
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (-49.17%)
Mutual labels:  speech-to-text
Speech Vad Demo
集成Webrtc的VAD,用于切分音频文件
Stars: ✭ 259 (-52.48%)
Mutual labels:  speech
Speech Demo
语音api示例
Stars: ✭ 454 (-16.7%)
Mutual labels:  speech-to-text
Cheetah
On-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-29.72%)
Mutual labels:  speech-to-text
Speech Aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-52.48%)
Mutual labels:  speech
Amazing Python Scripts
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (-57.98%)
Mutual labels:  speech
Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (-29.54%)
Mutual labels:  speech
Noise2Noise-audio denoising without clean training data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-91.01%)
Mutual labels:  speech
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-83.85%)
Mutual labels:  speech
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (-9.54%)
Mutual labels:  speech
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-19.27%)
Mutual labels:  speech-to-text
Tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (-44.04%)
Mutual labels:  speech
minutes
🔭 Speaker diarization via transfer learning
Stars: ✭ 25 (-95.41%)
Mutual labels:  speech
demo vietasr
Vietnamese Speech Recognition
Stars: ✭ 22 (-95.96%)
Mutual labels:  speech-to-text
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (-33.58%)
Mutual labels:  speech
flite-go
Go bindings for Flite (festival-lite)
Stars: ✭ 14 (-97.43%)
Mutual labels:  speech
Cboard
AAC communication system with text-to-speech for the browser
Stars: ✭ 437 (-19.82%)
Mutual labels:  speech
Inaspeechsegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Stars: ✭ 352 (-35.41%)
Mutual labels:  speech
kim-voice-assistant
Kim,你的私人语音助理。
Stars: ✭ 70 (-87.16%)
Mutual labels:  speech-to-text
tt-vae-gan
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-93.21%)
Mutual labels:  speech
Autoedit 2
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Stars: ✭ 343 (-37.06%)
Mutual labels:  speech-to-text
BangalASR
Transformer based Bangla Speech Recognition
Stars: ✭ 20 (-96.33%)
Mutual labels:  speech-to-text
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-4.22%)
Mutual labels:  speech-to-text
Speech256
An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.
Stars: ✭ 51 (-90.64%)
Mutual labels:  speech
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+806.97%)
Mutual labels:  speech-to-text
Tts
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+895.78%)
Mutual labels:  speech
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-86.42%)
Mutual labels:  speech
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-96.51%)
Mutual labels:  speech-to-text
Ios 10 Sampler
Code examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+513.03%)
Mutual labels:  speech
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+1470.64%)
Mutual labels:  speech-to-text
1-60 of 304 similar projects