A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-60.67%)

Mutual labels: speech, speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-76.4%)

Mutual labels: speech, speech-recognition, speech-to-text

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+1269.66%)

Mutual labels: speech, speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+101.12%)

Mutual labels: speech, speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+450.56%)

Mutual labels: speech, speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+43.82%)

Mutual labels: speech, speech-recognition, speech-to-text

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-76.4%)

Mutual labels: captions, speech-recognition, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (+85.39%)

Mutual labels: speech, speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+12429.21%)

Mutual labels: speech, speech-recognition, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+171.91%)

Mutual labels: speech, speech-recognition, speech-to-text

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-75.28%)

Mutual labels: speech-recognition, speech-to-text, stt

bingspeech-api-client

Microsoft Bing Speech API client in node.js

Stars: ✭ 32 (-64.04%)

Mutual labels: tts, speech-to-text, stt

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-84.27%)

Mutual labels: speech, speech-recognition, speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (+15.73%)

Mutual labels: tts, speech-recognition, speech-to-text

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-60.67%)

Mutual labels: speech, speech-recognition, speech-to-text

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-41.57%)

Mutual labels: speech, tts, speech-recognition

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-35.96%)

Mutual labels: speech, speech-recognition, speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (-82.02%)

Mutual labels: captions, speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-40.45%)

Mutual labels: tts, speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+130.34%)

Mutual labels: speech, speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (-60.67%)

Mutual labels: captions, speech-recognition, speech-to-text

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+2279.78%)

Mutual labels: speech-recognition, speech-to-text, stt

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-71.91%)

Mutual labels: speech-recognition, speech-to-text, stt

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+6884.27%)

Mutual labels: speech, speech-recognition, speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+497.75%)

Mutual labels: speech, speech-recognition, speech-to-text

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+297.75%)

Mutual labels: speech-recognition, speech-to-text, stt

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-28.09%)

Mutual labels: speech, speech-to-text

Soloud

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (+1077.53%)

Mutual labels: speech, speech-to-text

Tts

Tools to convert text to speech 📚💬

Stars: ✭ 84 (-5.62%)

Mutual labels: speech, tts

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (+11.24%)

Mutual labels: speech, speech-recognition

Gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

Stars: ✭ 1,303 (+1364.04%)

Mutual labels: speech, tts

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-57.3%)

Mutual labels: speech-recognition, speech-to-text

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (+19.1%)

Mutual labels: speech, speech-recognition

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+1313.48%)

Mutual labels: speech, speech-recognition

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+1561.8%)

Mutual labels: speech, speech-recognition

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+24.72%)

Mutual labels: speech, tts

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+39.33%)

Mutual labels: speech, speech-recognition

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (+32.58%)

Mutual labels: speech, tts

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (+48.31%)

Mutual labels: speech, speech-recognition

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (+51.69%)

Mutual labels: speech, speech-recognition

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+2082.02%)

Mutual labels: speech, tts

Tts Papers

🐸 collection of TTS papers

Stars: ✭ 160 (+79.78%)

Mutual labels: speech, tts

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-66.29%)

Mutual labels: speech-recognition, speech-to-text

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (+28.09%)

Mutual labels: speech, speech-recognition

Tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+1873.03%)

Mutual labels: speech, tts

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+2256.18%)

Mutual labels: speech, speech-recognition

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (-22.47%)

Mutual labels: speech-recognition, speech-to-text

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+175.28%)

Mutual labels: speech, speech-to-text

XION-ChaseCam

This is a free-to-use HTML/javascript based overlay for roleplay streamers. Basically it mimics the overlay of the AXON bodycam, but since most folks play in 3rd person, it's a ChaseCam. I've included a logo, and the html file. The html file has the css, html, and javascript all in one file for ease of editing. Goto line 81 of the html file to c…

Stars: ✭ 27 (-69.66%)

Mutual labels: twitch, obs

1-60 of 964 similar projects

›

next*5