| Item |
Remarks |
Example |
| Lexical unit |
Standardized spelling/character coding. Define a
lexical unit: words, interjections, neologisms? Lexical units are usually not tagged. |
station |
| Spelling |
Spelling of a word or abbreviation letter by letter. |
$U $S $A |
| Acronyms |
Official substitutes for words or phrases, spelled like a word |
OPEC |
| Proper names |
All names that cannot be translated into another language: People's names, street names, restaurants etc. |
~Peter ~Marine+World |
| Numbers |
Numerals, combinations of numbers and ordinal numbers. All number written as words |
#three #twenty #hundred |
| Neologism |
Word that has been made up by the speaker |
*deliverator |
| Foreign Words |
Words that are from another language and have not been officially adopted by the main language |
*IT saluti |
| Off-Talk |
person is speaking to himself or herself and not to the partner(s) of the dialogue |
what OOT do OOT I OOT![$>$]() |
| Read Off-Talk |
Off-talk caused by reading aloud |
seven ROT![$>$]() |
| Command Words |
Words to operate a dialogue system |
!KEYComputer |
| Lengthening |
Markup of sounds within an item that are lengthened |
so L rry |
| Garbage |
words completely or partly incomprehensible |
% three% |
| Truncation |
Item is truncated for several reasons (technical, stutter etc.) |
so the que= by hel= *T![$>$]() |
| Interruption |
Items may be interrupted for several reasons: pauses, breathing, hesitations etc. |
trans_ A _lation |
| Missing signal |
Missing parts of the signal for technical reasons have to be marked in the transcript. |
see ![[*]](crossref.png) |