All Projects → saharmor → realtime-transcription-playground

saharmor / realtime-transcription-playground

Licence: MIT license
A real-time transcription project using React and socketio

Programming Languages

javascript
184084 projects - #8 most used programming language
python
139335 projects - #7 most used programming language
HTML
75241 projects
CSS
56736 projects

Projects that are alternatives of or similar to realtime-transcription-playground

Recording-Bot
A bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-78.22%)
Mutual labels:  transcription, google-speech-recognition
gtranscribe
Software for interview transcription
Stars: ✭ 12 (-88.12%)
Mutual labels:  transcription
speaker.app
Source code for https://speaker.app, a batteries-included, web-based, quasi-decentralized, WebRTC networking platform, with a primary focus on audio and screen-sharing, and a secondary focus on chat messages and peripheral features.
Stars: ✭ 26 (-74.26%)
Mutual labels:  microphone
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-79.21%)
Mutual labels:  transcription
glaemscribe
Glaemscribe, the tolkienian languages/writings transcription engine.
Stars: ✭ 29 (-71.29%)
Mutual labels:  transcription
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-87.13%)
Mutual labels:  transcription
cala
Cross-platform system interface for hardware IO
Stars: ✭ 46 (-54.46%)
Mutual labels:  microphone
cmu-pronouncing-dictionary
The 134,000+ words and their pronunciations in the CMU pronouncing dictionary
Stars: ✭ 46 (-54.46%)
Mutual labels:  transcription
figaro
Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵
Stars: ✭ 362 (+258.42%)
Mutual labels:  microphone
speech-to-text
Python helper for Google and IBM Watson speech-to-text cloud APIs.
Stars: ✭ 14 (-86.14%)
Mutual labels:  transcription
Echo
A simple iOS application redirects microphone input to the Bluetooth audio device
Stars: ✭ 33 (-67.33%)
Mutual labels:  microphone
RecPlayer-iOS
A simple iOS application that records audio and plays it back. (+some animations)
Stars: ✭ 21 (-79.21%)
Mutual labels:  microphone
Record-Audio-on-Windows
A Go program that uses winmm.dll to record audio to a WAV file.
Stars: ✭ 28 (-72.28%)
Mutual labels:  microphone
disable-autogain-control-extension
A chrome extension which disables the automatic microphone gain control in the MediaStream Web API.
Stars: ✭ 133 (+31.68%)
Mutual labels:  microphone
scription
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
Stars: ✭ 46 (-54.46%)
Mutual labels:  transcription
ToxicEye
👽 Program for remote control of windows computers via telegram bot. Written in C#
Stars: ✭ 305 (+201.98%)
Mutual labels:  microphone
izabela-desktop
A proof of concept text-to-speech application allowing global typing. Can be used over applications such as voice chats, games and much more.
Stars: ✭ 62 (-38.61%)
Mutual labels:  microphone
FARED for Anomaly Detection
Official source code of "Fast Adaptive RNN Encoder-Decoder for Anomaly Detection in SMD Assembly Machine"
Stars: ✭ 14 (-86.14%)
Mutual labels:  microphone
Braille-Translator
Translates standard alphabet based text to Grade 2 Braille and back.
Stars: ✭ 29 (-71.29%)
Mutual labels:  transcription
music led strip control
Audio visualization for LED strips in real-time with web interface on a raspberry pi.
Stars: ✭ 224 (+121.78%)
Mutual labels:  microphone

giant microphone

Real-time Transcription Playground

A real-time transcription project using React and a socketio python server. The goal of this project is to enable developers to create web demos and speech2text prototypes with just a few lines of code. Examples can be medical dictation apps, a note-taking CRM for entrepreneurs, etc.

Currently only supports real-time transcription using Google Cloud Speech

Demo

Real-time.transcription.demo.mp4

Installation

Google Speech API

The code assumes an environment variable GOOGLE_SERVICE_JSON_FILE that points to a valid GCP service account file.

If you need to get a service account:

  • Within your Google Cloud console, create or select a project
  • Enable the Cloud Speech API for that project
  • Create a service account
  • Download a private key as JSON

More info in Google Cloud's docs here and here.

Then, set the environment variable GOOGLE_SERVICE_JSON_FILE to the path of the JSON file containing your service account key, e.g. /users/sahar/documents/sample-project-3c1a5892b00e.json. Further details can be found in this Medium article.

Setup

  1. Clone or fork this repository
  2. Create a virtual environment in the root directory: python -m venv $ENV_NAME
  3. Activate the virtual environment: source $ENV_NAME/bin/activate (for MacOS, Unix, or Linux users) or .\ENV_NAME\Scripts\activate (for Windows users)
  4. Install requirements: pip install -r backend/requirements.txt
  5. Set your environment variable GOOGLE_SERVICE_JSON_FILE to point to your file path
  6. Run yarn install in the root directory
  7. Run yarn start to start the frontend and start-backend to run the backend
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].