WWWTranscribe is a tool for the annotation of audio signals via
the WWW. It features an oscillogram display of the speech signal,
audio output, editing buttons that simplify the task of annotating the
signal, and a formal consistency checker for the
annotations. WWWTranscribe was developed at the Bavarian Archive for
Speech Signals (BAS)8.20 within the
SpeechDat project. Currently8.21, it supports orthographic transcriptions
according to the SpeechDat guidelines; other annotation systems can be
added simply by extending the annotation object class hierarchy.
WWWTranscribe is implemented in Java using only the standard JDK classes to guarantee
platform independence.
In WWWTranscribe, the transcriber logs in and enters the ID of the session to be transcribed. A session consists of a number of recordings, each containing a single utterance corresponding to a prompt in the interview. Once a recording is selected, the transcription page is displayed. It contains a single output button with a speaker icon, a signal display, transcription and comment text fields, an assessment menu, and save and clear buttons (see figure
WWWTranscribe performs an automatic consistency check on the annotation text so that only formally valid annotations are entered into the annotation database.
At the BAS WWWTranscribe has been successfully used for a wide range of transcription, tagging, validation and evaluation tasks. WWWTranscribe is currently being packaged for public distribution8.22.