Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Idlak → Living Audio Dataset

Idlak / Living Audio Dataset

Licence: apache-2.0

A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone to be able to add to it.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

audio

Projects that are alternatives of or similar to Living Audio Dataset

Openaudible

Open Source Audible Manager

Stars: ✭ 932 (+3113.79%)

Mutual labels: audio

Awesome Musicdsp

A curated list of my favourite music DSP and audio programming resources

Stars: ✭ 871 (+2903.45%)

Mutual labels: audio

Preamp Two

An digitally controlled hi fi preamplifier 🔈🎵

Stars: ✭ 21 (-27.59%)

Mutual labels: audio

Cs.2click

🔊 A Better Audio Router for a Modular System.

Stars: ✭ 7 (-75.86%)

Mutual labels: audio

Mems mic breakout Admp401

SparkFun MEMS Microphone Breakout - INMP401 (ADMP401)

Stars: ✭ 10 (-65.52%)

Mutual labels: audio

Iplug2

C++ Audio Plug-in Framework for desktop, mobile and web [PRE-RELEASE]

Stars: ✭ 875 (+2917.24%)

Mutual labels: audio

Flavy

Simple API for convert audio/video files, get thumbnails from video, information of files

Stars: ✭ 25 (-13.79%)

Mutual labels: audio

Webaudio Wav Stream Player

instantly play remote wav streams using fetch API + WebAudio

Stars: ✭ 28 (-3.45%)

Mutual labels: audio

Play

Play audio files from terminal.

Stars: ✭ 12 (-58.62%)

Mutual labels: audio

Harlowe Audio

An audio library for use with the Twine 2 story format Harlowe (v2.x or higher).

Stars: ✭ 21 (-27.59%)

Mutual labels: audio

Libav

Libav github mirror, clone of git://git.libav.org/libav

Stars: ✭ 847 (+2820.69%)

Mutual labels: audio

Vst3sdk

VST 3 Plug-In SDK

Stars: ✭ 853 (+2841.38%)

Mutual labels: audio

Tdarr

Tdarr - Distributed transcode automation using FFmpeg/HandBrake + Audio/Video library analytics + video health checking (Windows, macOS, Linux & Docker)

Stars: ✭ 911 (+3041.38%)

Mutual labels: audio

Chipkizi

a recording app for up and coming artists

Stars: ✭ 26 (-10.34%)

Mutual labels: audio

Arch Koel

Docker build script for Arch Linux base with Koel

Stars: ✭ 21 (-27.59%)

Mutual labels: audio

Jssynth

Make music in your browser with this synthesizer and sequencer

Stars: ✭ 25 (-13.79%)

Mutual labels: audio

Uc3moy

UC3Music-e version of the CMoy pocket headphone amplifier (originally designed by Chu Moy )

Stars: ✭ 13 (-55.17%)

Mutual labels: audio

Textplayer

A library to play MML and ABC songs, written in C#

Stars: ✭ 29 (+0%)

Mutual labels: audio

Jvmsounds

Play memory allocation rate and GC events as sine wave and percussion, respectively.

Stars: ✭ 28 (-3.45%)

Mutual labels: audio

Soloud Rs

Rust bindings for the soloud audio engine library

Stars: ✭ 21 (-27.59%)

Mutual labels: audio

View All Similar Projects ➔

The Living Audio Dataset

A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone to be able to add to it.

Languages are required to be 2 letters, normally their 2 letter ISO code, see: wiki article on ISO_639-1 codes

Accents are required to also be 2 letters.

Speakers are 3 letter codes. If a speaker was recorded in multiple languages this should be noted in the noted in the speaker's README and where possible their speaker code should be the same in both languages. It is highly recommended that the recording environment is noted in the README, including microphone, location, and the original sample rate. The audio should be saved in uncompressed .wav files and then zipped.

Directory structure

Maintaining the directory structure is very important for the tools in IDLAK.

Language resources

ln
- text
  - source.xml
- README.md
- word_frequencies.xml
- other resources in xml

"word_frequencies.xml" is an example not a requirement.

See below for the text source format. Source names should be a reasonable name. If a source is really only for one region or accent it is recommended that the source file name starts with region and an underscore, for example: "uk_bbc.xml". For this reason we recommend not including underscores in your file names. We also recommend not to include spaces in file names.

Text source format

We recommend keeping files to a reasonable size (under 5000 lines)

The text source xml format is:

<?xml version="1.0" encoding="UTF-8"?>
<text_sources>
  <text_source id="unique within file" name="any name" url="original url">
    <notes>
      Notes here are ignored and this section is optional.
      other than id all attributes are optional in the text_source tag 
    </notes>
    <text>
      Text source here, utf8-encoded.
    </text>
  </text_source>
  <text_source />
  <text_source />
  <text_source />
</text_sources>

Speaker resources

ln_ac_spk
- README.md
- text.xml
- lexicon.xml

Audio is uploaded to archive.org in tar.gz format

The script can note the pronunciation of specific words by the speaker with <pron> tags.

The lexicon is optional and will be appended to the language lexicon in the IDLAK main repo.

Current speakers

Speaker	Language	Accent	Gender	Total duration(mm:ss)	Sample rate (Hz)
ABW	Dutch (nl)	Netherlands (nl)	Man	57:49	48,000
RBU	English (en)	RP (rp)	Man	50:50	48,000
CLL	Irish (ga)	Non-native (ie)	Man	61:56	48,000
ABR	Russuan (ru)	Russian (ru)	Woman	34:58	48,000

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 29

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗