next up previous contents
Next: Intended audience Up: Introduction Previous: Introduction   Contents


Summary

This document is the result of a study conducted within the German BITS project in 2002. BITS1.1is an acronym for BAS1.2: Infrastructures for Technical Speech Processing and is a 100% publicly funded project devoted to the improvement of the infrastructural situation in Spoken Language Processing (SLP) of the German language. One of the sub-projects of BITS aims to come up with a cookbook-like document on the topic of Speech Corpora Validation.

Speech Corpus in the scope of this document means a collection of digital recordings of speech created with the aim of exploring the functioning of speech communication, often with respect to certain technical applications like Automatic Speech Recognition (ASR), Speech Synthesis or Speaker Verification etc.

The term Validation refers to a process that analyses and documents either a completed speech corpus or a speech corpus that is in the process of being produced with regard to its specifications.

Speech Corpus Validation has several important applications in the field of Spoken Language Processing (SLP):

This document is a cookbook for speech corpus validation. It is the result of the validation experiences gained at the Bavarian Archive for Speech Signals (BAS)1.3 in numerous corpus collections.


next up previous contents
Next: Intended audience Up: Introduction Previous: Introduction   Contents
Angela Baumann 2004-06-03