BAS
Bavarian Archive for Speech Signals
Regional Variants of German 1 - RVG1

Last update 2014-03-04 - gleiche Seite in deutsch

The corpus consists of single digits, connected digits, phone numbers, phonetically balanced sentences, computer command phrases and spontaneous speech. Each speaker (498) has read a subcorpus of 85 items selected pseudo-randomly from several text corpora. The speaker was placed in front of a standard IBM-compatible PC in normal different office environments. Four different microphones (high quality to low cost) are used in parallel. Speakers were selected to achieve the demoscopic density of the German spoken areas in Europe (including Austria and Switzerland).

Vocabulary size read speech: 2080
Vocabulary size spontaneous speech: 8285

Online Corpus Documentation

Audio files

Speaker 254, Region F (Alemannisch), phone number
Talk Back Microphone (AT&T) fünf zwei drei eins zwei sieben drei
Telex Microphone (Soundblaster) fünf zwei drei eins zwei sieben drei
Sennheiser HD 410 fünf zwei drei eins zwei sieben drei
Sennheiser MD 441 fünf zwei drei eins zwei sieben drei

Revalidation report

Availability and Costs

Licensed.

Regional Variants of German 1 - RVG1
498 speakers (low quality mics), 421 speakers (high quality mics), 19.3GByte
32 CDROM Iso 9660 + shipping + handling EUR 8180.67 (ELRA Members 50% Discount)
Scientific License EUR 3579.04
Commercial License EUR 8691.96 (ELRA Members EUR 7669.38)

Remark:
The RVG1 corpus is also distributed on DVD-R or in subsets of single microphone channels. This reduces the amount of data or volumes and consequently the CDROM production fee. However, the license fee remains the same for all parts of the corpus.

Questions and orders to:


Florian Schiel