All Projects â†’ evancohen â†’ Sonus

evancohen / Sonus

Licence: mit
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Sonus

KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-96.05%)
Mutual labels:  speech, voice-recognition, speech-recognition, speech-to-text, voice-control
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-90.04%)
Mutual labels:  voice-recognition, speech-recognition, speech-to-text, voice-control
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-90.04%)
Mutual labels:  speech, speech-recognition, speech-to-text, voice-control
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-23.68%)
Mutual labels:  voice-control, speech-recognition, speech-to-text, voice-recognition
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+58.08%)
Mutual labels:  voice-recognition, speech-recognition, speech-to-text
Mycroft Precise
A lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-9.59%)
Mutual labels:  voice-control, speech-recognition, voice-recognition
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-94.92%)
Mutual labels:  voice-recognition, speech-recognition, speech-to-text
voce-browser
Voice Controlled Chromium Web Browser
Stars: ✭ 34 (-93.61%)
Mutual labels:  voice-recognition, speech-recognition, speech-to-text
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-83.27%)
Mutual labels:  speech, speech-recognition, speech-to-text
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-84.59%)
Mutual labels:  speech, speech-recognition, speech-to-text
Cheetah
On-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-28.01%)
Mutual labels:  speech-recognition, speech-to-text, voice-recognition
Hey Athena Client
Your personal voice assistant
Stars: ✭ 336 (-36.84%)
Mutual labels:  alexa, voice-control, voice-recognition
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-97.37%)
Mutual labels:  speech, speech-recognition, speech-to-text
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (-9.59%)
Mutual labels:  speech-recognition, speech-to-text, voice-recognition
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-90.6%)
Mutual labels:  voice-recognition, speech-recognition, speech-to-text
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-93.42%)
Mutual labels:  speech, speech-recognition, speech-to-text
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-76.88%)
Mutual labels:  speech, speech-recognition, speech-to-text
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-96.43%)
Mutual labels:  speech-recognition, speech-to-text, voice-control
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-26.13%)
Mutual labels:  speech-recognition, speech, speech-to-text
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-94.36%)
Mutual labels:  voice-recognition, speech-recognition, speech-to-text

sonus

Build Status Dependency Status

A dead simple STT library in Node

Sonus lets you quickly and easily add a VUI (Voice User Interface) to any hardware or software project. Just like Alexa, Google Assistant, and Siri, Sonus is always listening offline for a customizable hotword. Once that hotword is detected your speech is streamed to the cloud recognition service of your choice - then you get the results in realtime.

Platform Support

  • [X] Linux - most major distros (Including Raspbian)
  • [X] macOS
  • [ ] Windows

Streaming Recognition Services

  • [X] Google Cloud Speech
  • [ ] Alexa Voice Services
  • [ ] Wit.ai
  • [ ] Microsoft Cognitive Services
  • [ ] Houndify

Installation

npm install --save sonus

Dependencies

Generally, running npm install should suffice. This module however, requires you to install SoX.

For most linux disto's

Recommended: use arecord, which comes with most linux distros. Alternatively:

sudo apt-get install sox libsox-fmt-all

For macOS

brew install sox

Usage

Configure out cloud speech recognition system of choice, like Google Cloud Speech API.

Note: You need to use the GOOGLE_APPLICATION_CREDENTIALS environment variable for your JSON keyfile, or check the examples to see how you can pass in the keyflie path.

Add sonus and said recognizer:

const Sonus = require('sonus')
const speech = require('@google-cloud/speech')
const client = new speech.SpeechClient()

Add your keyword and initialize Sonus with a Snowboy hotword:

const hotwords = [{ file: 'resources/snowboy.umdl', hotword: 'snowboy' }]
const sonus = Sonus.init({ hotwords }, client)

Create your own Alexa in less than a tweet:

Sonus.start(sonus)
sonus.on('hotword', (index, keyword) => console.log("!"))
sonus.on('final-result', console.log)

Full API Documentation

Versioning

This project uses semantic versioning as of v0.1.0

How do I set up Google Cloud Speech API?

Follow these instructions.

How do I make my own hotword?

Sonus uses Snowboy for offline hotword recognition. You can use their website or API to train a model for a new hotword. Hotword training must occur online through their web service.

Built #withsonus

If you've build a project with Sonus send a PR and include it here!

Authors

Evan Cohen: @_evnc
Ashish Chandwani: @ashishschandwa1

License

Licensed under MIT.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].