All Projects → Idlak → Living Audio Dataset

Idlak / Living Audio Dataset

Licence: apache-2.0
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone to be able to add to it.

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Living Audio Dataset

Openaudible
Open Source Audible Manager
Stars: ✭ 932 (+3113.79%)
Mutual labels:  audio
Awesome Musicdsp
A curated list of my favourite music DSP and audio programming resources
Stars: ✭ 871 (+2903.45%)
Mutual labels:  audio
Preamp Two
An digitally controlled hi fi preamplifier 🔈🎵
Stars: ✭ 21 (-27.59%)
Mutual labels:  audio
Cs.2click
🔊 A Better Audio Router for a Modular System.
Stars: ✭ 7 (-75.86%)
Mutual labels:  audio
Mems mic breakout Admp401
SparkFun MEMS Microphone Breakout - INMP401 (ADMP401)
Stars: ✭ 10 (-65.52%)
Mutual labels:  audio
Iplug2
C++ Audio Plug-in Framework for desktop, mobile and web [PRE-RELEASE]
Stars: ✭ 875 (+2917.24%)
Mutual labels:  audio
Flavy
Simple API for convert audio/video files, get thumbnails from video, information of files
Stars: ✭ 25 (-13.79%)
Mutual labels:  audio
Webaudio Wav Stream Player
instantly play remote wav streams using fetch API + WebAudio
Stars: ✭ 28 (-3.45%)
Mutual labels:  audio
Play
Play audio files from terminal.
Stars: ✭ 12 (-58.62%)
Mutual labels:  audio
Harlowe Audio
An audio library for use with the Twine 2 story format Harlowe (v2.x or higher).
Stars: ✭ 21 (-27.59%)
Mutual labels:  audio
Libav
Libav github mirror, clone of git://git.libav.org/libav
Stars: ✭ 847 (+2820.69%)
Mutual labels:  audio
Vst3sdk
VST 3 Plug-In SDK
Stars: ✭ 853 (+2841.38%)
Mutual labels:  audio
Tdarr
Tdarr - Distributed transcode automation using FFmpeg/HandBrake + Audio/Video library analytics + video health checking (Windows, macOS, Linux & Docker)
Stars: ✭ 911 (+3041.38%)
Mutual labels:  audio
Chipkizi
a recording app for up and coming artists
Stars: ✭ 26 (-10.34%)
Mutual labels:  audio
Arch Koel
Docker build script for Arch Linux base with Koel
Stars: ✭ 21 (-27.59%)
Mutual labels:  audio
Jssynth
Make music in your browser with this synthesizer and sequencer
Stars: ✭ 25 (-13.79%)
Mutual labels:  audio
Uc3moy
UC3Music-e version of the CMoy pocket headphone amplifier (originally designed by Chu Moy )
Stars: ✭ 13 (-55.17%)
Mutual labels:  audio
Textplayer
A library to play MML and ABC songs, written in C#
Stars: ✭ 29 (+0%)
Mutual labels:  audio
Jvmsounds
Play memory allocation rate and GC events as sine wave and percussion, respectively.
Stars: ✭ 28 (-3.45%)
Mutual labels:  audio
Soloud Rs
Rust bindings for the soloud audio engine library
Stars: ✭ 21 (-27.59%)
Mutual labels:  audio

The Living Audio Dataset

A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone to be able to add to it.

Languages are required to be 2 letters, normally their 2 letter ISO code, see: wiki article on ISO_639-1 codes

Accents are required to also be 2 letters.

Speakers are 3 letter codes. If a speaker was recorded in multiple languages this should be noted in the noted in the speaker's README and where possible their speaker code should be the same in both languages. It is highly recommended that the recording environment is noted in the README, including microphone, location, and the original sample rate. The audio should be saved in uncompressed .wav files and then zipped.

Directory structure

Maintaining the directory structure is very important for the tools in IDLAK.

Language resources

  • ln
    • text
      • source.xml
    • README.md
    • word_frequencies.xml
    • other resources in xml

"word_frequencies.xml" is an example not a requirement.

See below for the text source format. Source names should be a reasonable name. If a source is really only for one region or accent it is recommended that the source file name starts with region and an underscore, for example: "uk_bbc.xml". For this reason we recommend not including underscores in your file names. We also recommend not to include spaces in file names.

Text source format

We recommend keeping files to a reasonable size (under 5000 lines)

The text source xml format is:

<?xml version="1.0" encoding="UTF-8"?>
<text_sources>
  <text_source id="unique within file" name="any name" url="original url">
    <notes>
      Notes here are ignored and this section is optional.
      other than id all attributes are optional in the text_source tag 
    </notes>
    <text>
      Text source here, utf8-encoded.
    </text>
  </text_source>
  <text_source />
  <text_source />
  <text_source />
</text_sources>

Speaker resources

  • ln_ac_spk
    • README.md
    • text.xml
    • lexicon.xml

Audio is uploaded to archive.org in tar.gz format

The script can note the pronunciation of specific words by the speaker with <pron> tags.

The lexicon is optional and will be appended to the language lexicon in the IDLAK main repo.

Current speakers

Speaker Language Accent Gender Total duration(mm:ss) Sample rate (Hz)
ABW Dutch (nl) Netherlands (nl) Man 57:49 48,000
RBU English (en) RP (rp) Man 50:50 48,000
CLL Irish (ga) Non-native (ie) Man 61:56 48,000
ABR Russuan (ru) Russian (ru) Woman 34:58 48,000
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].