Source code for https://speaker.app, a batteries-included, web-based, quasi-decentralized, WebRTC networking platform, with a primary focus on audio and screen-sharing, and a secondary focus on chat messages and peripheral features.

Stars: ✭ 26 (-74.26%)

Mutual labels: microphone

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-79.21%)

Mutual labels: transcription

glaemscribe

Glaemscribe, the tolkienian languages/writings transcription engine.

Stars: ✭ 29 (-71.29%)

Mutual labels: transcription

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-87.13%)

Mutual labels: transcription

cala

Cross-platform system interface for hardware IO

Stars: ✭ 46 (-54.46%)

Mutual labels: microphone

cmu-pronouncing-dictionary

The 134,000+ words and their pronunciations in the CMU pronouncing dictionary

Stars: ✭ 46 (-54.46%)

Mutual labels: transcription

figaro

Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵

Stars: ✭ 362 (+258.42%)

Mutual labels: microphone

speech-to-text

Python helper for Google and IBM Watson speech-to-text cloud APIs.

Stars: ✭ 14 (-86.14%)

Mutual labels: transcription

Echo

A simple iOS application redirects microphone input to the Bluetooth audio device

Stars: ✭ 33 (-67.33%)

Mutual labels: microphone

RecPlayer-iOS

A simple iOS application that records audio and plays it back. (+some animations)

Stars: ✭ 21 (-79.21%)

Mutual labels: microphone

Record-Audio-on-Windows

A Go program that uses winmm.dll to record audio to a WAV file.

Stars: ✭ 28 (-72.28%)

Mutual labels: microphone

disable-autogain-control-extension

A chrome extension which disables the automatic microphone gain control in the MediaStream Web API.

Stars: ✭ 133 (+31.68%)

Mutual labels: microphone

scription

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

Stars: ✭ 46 (-54.46%)

Mutual labels: transcription

ToxicEye

👽 Program for remote control of windows computers via telegram bot. Written in C#

Stars: ✭ 305 (+201.98%)

Mutual labels: microphone

izabela-desktop

A proof of concept text-to-speech application allowing global typing. Can be used over applications such as voice chats, games and much more.

Stars: ✭ 62 (-38.61%)

Mutual labels: microphone

FARED for Anomaly Detection

Official source code of "Fast Adaptive RNN Encoder-Decoder for Anomaly Detection in SMD Assembly Machine"

Stars: ✭ 14 (-86.14%)

Mutual labels: microphone

Braille-Translator

Translates standard alphabet based text to Grade 2 Braille and back.

Stars: ✭ 29 (-71.29%)

Mutual labels: transcription

music led strip control

Audio visualization for LED strips in real-time with web interface on a raspberry pi.

Stars: ✭ 224 (+121.78%)

Mutual labels: microphone

View All Similar Projects ➔

Real-time Transcription Playground

A real-time transcription project using React and a socketio python server. The goal of this project is to enable developers to create web demos and speech2text prototypes with just a few lines of code. Examples can be medical dictation apps, a note-taking CRM for entrepreneurs, etc.

Currently only supports real-time transcription using Google Cloud Speech

Demo

Real-time.transcription.demo.mp4

Installation

Python 3 instructions
yarn instructions

Google Speech API

The code assumes an environment variable GOOGLE_SERVICE_JSON_FILE that points to a valid GCP service account file.

If you need to get a service account:

Within your Google Cloud console, create or select a project
Enable the Cloud Speech API for that project
Create a service account
Download a private key as JSON

More info in Google Cloud's docs here and here.

Then, set the environment variable GOOGLE_SERVICE_JSON_FILE to the path of the JSON file containing your service account key, e.g. /users/sahar/documents/sample-project-3c1a5892b00e.json. Further details can be found in this Medium article.

Setup

Clone or fork this repository
Create a virtual environment in the root directory: python -m venv $ENV_NAME
Activate the virtual environment: source $ENV_NAME/bin/activate (for MacOS, Unix, or Linux users) or .\ENV_NAME\Scripts\activate (for Windows users)
Install requirements: pip install -r backend/requirements.txt
Set your environment variable GOOGLE_SERVICE_JSON_FILE to point to your file path
Run yarn install in the root directory
Run yarn start to start the frontend and start-backend to run the backend

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

saharmor / realtime-transcription-playground

Programming Languages

Labels

Projects that are alternatives of or similar to realtime-transcription-playground

Real-time Transcription Playground

Demo

Installation

Google Speech API

Setup