Authors |
Florian Schiel
|
Affiliation |
BAS Bayerisches Archiv für Sprachsignale Institut für Phonetik Universität München |
Postal address |
Schellingstr. 3 D 80799 München |
E-Mail |
bas@phonetik.uni-muenchen.de |
Telephone |
+49-89-2180-2758 |
Fax |
+49-89-21805790 |
Corpus Version |
0.9 |
Date |
27/04/2009 |
Status |
finished |
Comment |
All minor recommendations listed in this validation report have been
amended by the authors in version 1.0 of the corpus. With regard to this validation procedure the corpus is now error-free. |
Validation Guidelines |
Florian Schiel: The Validation of Speech
Corpora, IPS LMU München, 2003,
www.bas.uni-muenchen.de/Forschung/BITS/TP2/Cookbook |
The following required contents of the documentation have been checked:
- Administrative Information:
Validating person: not given
Date of validation: not given
Contact for requests regarding the corpus: ok.
Number and type of media: 929GByte on hard disc. ok.
Content of each medium: n.a.
Copyright statement and intellectual property rights (IPR): ok.
- Technical information:
Layout of media:
Information about file system type and directory structure:
file system type not given (NTFS), dir structure ok.
File nomenclatura: Explanation of used codes (no white space
in file names!): ok.
Formats of signals and annotation files: If non standard
formats are used, it's common to give
a fully description or convert into standard format: JPEG picture. ok.
Coding: n.a.
Compression: Just widely supported compressions like zip or
gzip should be used: JPEG implies (lossy) compression,
grade of compression quality not given.
Sampling rate: 30fps. ok.
Valid bits per pixel: 24. ok.
Image size: 776x578. ok.
Multiplexed signals: n.a.
- Database contents:
Clearly stated purpose of the recordings:
sign language recognition; no specifics given. ok.
Speech type(s):
basic signs and sentences read from screen. ok.
Instruction to speakers in full copy:
ok.
- Linguistic contents of prompted speech:
Specifications of the individual text items:
detailed description of basic signs, compounds and sentences. ok.
Specification for the prompt sheet design or specification of the
design of the speech prompts:
screen-prompted, no specification of prompt design given.
Example prompt sheet or example sound file from the speech prompting:
n.a.
- Speaker information:
Speaker recruitment strategies: not given.
Number of speakers: 25 speakers. ok.
Distribution of signers over sex, age, dialect regions:
not given; only implicitely in the sgner data profiles.
Signer profiles contain sufficient information but are only accessible
in the HTML documentation. A simple ASCII table should be added to
the corpus, so that automatic processing is feasible.
Description/definition of dialect regions: probably n.a.
- Recording platform and recording conditions:
Recording platform: n.a.
Position and type of camera: ok.
Position of speakers: ok.
Number of channels and channel separation: n.a.
Acoustical environment: n.a.
- Annotation:
The mapping of file names and content can only be seen by help of the
search fuction within the HTML documentation. For automatic processing this
is not acceptable. The mapping is stored in JavaScript tables in German and
English: /html/sentences.js ...
A simple ASCII table should provided in the corpus as well.
Unambiguous spelling standard used in annotations:
"The annotation follows the specifications of the Aachener Glossenumschrift,
developed by the Deaf Sign Language Research Team (DESIRE) at the RWTH
Aachen University." It is unclear whether this standard is widely used.
Labeling symbols: n.a.
List of non-standard spellings: probably n.a.
Distinction of homographs which are no homophones: n.a.
Character set used in annotations: not given; probably 7-BIT ASCII/HTML
Any other language dependend information as abbreviations etc.:
ok.
Annotation manual, guidelines, instructions: n.a.
Description of quality assurance procedures: n.a.
Selection of annotators: n.a.
Training of annotators: n.a.
Annotation tools used: n.a.
- Lexicon: n.a.
- Statistical information:
Frequency of sub-word
units: phonemes: n.a.
Word frequency table: n.a.
Status documentation: repairable
play: mplayer "mf://*.jpg" -mf fps=30
encode to MPEG4: mencoder "mf://*.jpg" -mf fps=30 -o con0066.avi -ovc lavc -lavcopts vcodec=mpeg4