next up previous contents
Next: About this document ... Up: The Validation of Speech Previous: WebCommand - Main Documentation   Contents


WebCommand - Validation Report

Summary
The speech corpus WebCommand has been validated against the specified checks as given in the validation contract (see annex) as well as against general principles of good practice. The validation covered completeness, formal checks and manual checks of selected subsamples. The overall quality of the corpus is good and there should be no problem in using the corpus for the intended and other applications. Some flaws in the corpus documentation may be corrected without much effort.

Introduction

This document summarizes the results of an inhouse validation of the speech corpus WebCommand12.1. WebCommand was produced by the Bavarian Archive for Speech Signals (BAS) in the year 2002 as a contractor to Siemens AG, Munich. The aim of the corpus was to record application-specific commands in British English and French by native speakers in a quiet office environment. The aimed application is the control of a so called WebPad (a laptop without keyboard) used for surfing the internet and some other proprietary services. The spoken texts were prompted on screen and recorded with two different microphones and in two different rooms. The data were transcribed using SpeechDat conventions. Also a canonical pronunciation dictionary with all spoken words was included in the corpus.

Validation Results

The following list contains all validation steps as specified in the validation contract12.2 together with the methodology and the results.

Validation Tools

Sox was used to check the format of the signal files as well as for clippings.

WWWTranscribe12.3 was used to manually check the transcripts and the lexicon.

Other Observations

None.

Comments

The documentation lacks some details, which should be provided by the producer:

Result

The corpus WebCommand is in a usable status.


next up previous contents
Next: About this document ... Up: The Validation of Speech Previous: WebCommand - Main Documentation   Contents
Angela Baumann 2004-06-03