A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.

Stars: ✭ 101 (-10.62%)

Mutual labels: corpus

ppt presenter

Convert ppt to video with audio track, using text to speech synthesis

Stars: ✭ 38 (-66.37%)

Mutual labels: tts

Fergun

An utility Discord bot written in C# using Discord.Net

Stars: ✭ 26 (-76.99%)

Mutual labels: tts

DANeS

DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)

Stars: ✭ 64 (-43.36%)

Mutual labels: corpus

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+81.42%)

Mutual labels: asr

rasr

The RWTH ASR Toolkit.

Stars: ✭ 43 (-61.95%)

Mutual labels: asr

View All Similar Projects ➔

Speech-Corpus-Collection

This repo is a collection of Speech Corpus for automatic speech recognition (ASR) and text-to-speech (TTS).

ASR Corpus

VCTK
Around 10.4GB. Alternative Host
LibriSpeech
Large-scale (1000 hours) corpus of read English speech.
TEDLIUM release 2
The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website. The authors have prepared and filtered these data in order to train acoustic models to participate to the International Workshop on Spoken Language Translation 2011 (the LIUM English/French SLT system reached the first rank in the SLT task).

TTS Corpus

CMU ARCTIC Databases
The databases consist of around 1150 utterances, including US English male (bdl) and female (slt) speakers, as well as other accented speakers.
The World English Bible
The World English Bible is a public domain update of the American Standard Version of 1901 into modern English. Its text and audio recordings are freely avaiable here. Unfortunately, however, each of the audio files matches a chapter, not a verse, so is too long in most cases. Kyubyong sliced them by verse manually. You can get them on his dropbox.
Nancy Corpus
The Nancy corpus from the 2011 Blizzard Challenge. The data is freely availiable for research use on the signing of a license.

General

The NSynth Dataset
NSynth is an audio dataset containing 305,979 musical notes, each with a unique pitch, timbre, and envelope. For 1,006 instruments from commercial sample libraries, we generated four second, monophonic 16kHz audio snippets, referred to as notes, by ranging over every pitch of a standard MIDI pian o (21-108) as well as five different velocities (25, 50, 75, 100, 127). The note was held for the first three seconds and allowed to decay for the final second.

Contact Me

Yunchao He
Weibo

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

candlewill / Speech-Corpus-Collection

Labels

Projects that are alternatives of or similar to Speech-Corpus-Collection

Speech-Corpus-Collection

ASR Corpus

TTS Corpus

General

Contact Me