Item |
Remarks |
Example |
Lexical unit |
Standardized spelling/character coding. Define a
lexical unit: words, interjections, neologisms? Lexical units are usually not tagged. |
station |
Spelling |
Spelling of a word or abbreviation letter by letter. |
$U $S $A |
Acronyms |
Official substitutes for words or phrases, spelled like a word |
OPEC |
Proper names |
All names that cannot be translated into another language: People's names, street names, restaurants etc. |
~Peter ~Marine+World |
Numbers |
Numerals, combinations of numbers and ordinal numbers. All number written as words |
#three #twenty #hundred |
Neologism |
Word that has been made up by the speaker |
*deliverator |
Foreign Words |
Words that are from another language and have not been officially adopted by the main language |
*IT saluti |
Off-Talk |
person is speaking to himself or herself and not to the partner(s) of the dialogue |
what OOT do OOT I OOT![$>$]() |
Read Off-Talk |
Off-talk caused by reading aloud |
seven ROT![$>$]() |
Command Words |
Words to operate a dialogue system |
!KEYComputer |
Lengthening |
Markup of sounds within an item that are lengthened |
so L rry |
Garbage |
words completely or partly incomprehensible |
% three% |
Truncation |
Item is truncated for several reasons (technical, stutter etc.) |
so the que= by hel= *T![$>$]() |
Interruption |
Items may be interrupted for several reasons: pauses, breathing, hesitations etc. |
trans_ A _lation |
Missing signal |
Missing parts of the signal for technical reasons have to be marked in the transcript. |
see ![[*]](crossref.png) |