next up previous contents
Next: Data Model Up: Annotation Previous: Annotation   Contents

Types of Annotation

The following list of annotations is taken from the documentation of the BAS Partitur Format8.2and will give you an idea of what different types of annotation might be used and what has already been done so far. Pure transcriptions or tagging are marked with an (T), while segmentations and labellings are marked with an (S): Note that a transcript contains no information about the time relation of its contents aside from the fact that usually a chunk of speech is associated to a chunk of transcript. For example, if the corpus is structured in paragraphs of read text, then each signal file stores the speech of one paragraph while the associated transcription file stores the transcript of what was said in the signal file, but there is no fine-grain time information about when each individual word starts and ends within the signal file.

A segmentation requires either

of the labelled category. For example, in a phonemic segmentation and labelling each segment will consist of the phoneme category (coded for instance in SAM-PA), the begin of the phonemic segment and the duration:
IPA:   1.2758934 0.097867  e:


next up previous contents
Next: Data Model Up: Annotation Previous: Annotation   Contents
BITS Projekt-Account 2004-06-01