Next:
General
Up:
The Production of Speech
Previous:
The Production of Speech
Contents
General
Introduction
Preface
Intended audience
Overview
Terms and Definitions
Acknowledgments
Disclaimer
Legal Aspects, Contracts
Copyrights, Intellectual Properties
Speaker and Producer
Client and Contractor
Copyright Holder and User
Data Protection
Third Party Distribution
ELDA
LDC
BAS
Sharing Model
Meta Data
Importance of Meta Data
Recording protocol
Minimal requirements
Technical recording conditions
Other useful parameters
Example: Verbmobil II
Speaker Profiles
Minimal requirements
Other useful parameters
Example: SmartKom
Comments
Speech Corpus Production
Corpus Specification
Speaker Profiles
Number of Speakers
Vocabulary
Domain
Task
Phonological Distribution
Speaking Style
Read Speech
Answering Speech
Command / Control Speech
Descriptive Speech
Non-prompted Speech
Spontaneous Speech
Neutral vs. Emotional
Recording Setup
Telephone Recording
On-site Recording
Field Recording
Wizard-of-Oz
Annotation
Technical Specifications
Sampling Rate
Sample Type and Width
Number of Channels, Interleave
File Formats
Corpus Structure
Structure
File Naming Conventions
Distribution Media
Release Plan / Validation Procedures
Meta Data
Documentation
Preparation of collection
Instructions and Prompting
Recording Techniques
Telephone Recordings
On-site Recordings
Field Recordings
Wizard-of-Oz Recordings
Questionnaires and Forms
Legal Aspects
Check Lists
Pre-test
Planning of Recruitment
Collection
Ongoing Documentation, Logging
Pre-Validation
Quality Control
Monitoring
Control of Recording Process
Security
Security against Theft
Security against Data Loss
Data Logistics
Storage
Data Pipelining
Recruitment
Basic Recruiting Techniques
Incentives
Post-processing
File Transfer
File Name Assignment
Editing
Filtering
Re-sampling
Format Conversion
Special Conversion for Annotation
Automatic Error Detection
Annotation
Types of Annotation
Data Model
Orthographic Transcription
General Rules for Transcription
Possible Transcript Items
Transcription Example
Transcription Method
Existing Transcription Formats
Transcription Tools
Tagging
Segmentation and Labeling
Segments vs. Points-in-Time
Manual Segmentation
Automatic and Semi-automatic Segmentation
Annotation Methods
Manual Annotation Tools
WWWTranscribe
Praat
Internal Validation
Pronunciation Dictionary
File Format
Pronunciation Encoding
Lexical Encoding
Additional Contents
Examples
Simple List - Verbmobil
Simple List - The HTK Standard
Enriched Dictionary - PHONOLEX
Documentation
Starting Document
The Core Documentation
Other Documents
Validation
In-house vs. External
When to validate
Pre-Validation
Release Validation
Final Validation
What to validate
Validation Reports
Example
Distribution
Media Production
Compression / Compatibility
Signal / Symbolic Data
Safety / Verify / Versions
Larger Edition vs. Burn-on-Demand
On-line Distribution
Examples
WebCommand
Corpus Specification of WebCommand
Meta Data of WebCommand
Recording Protocol
Speaker Profiles
Comments to WebCommand
WebCommand Documentation
SpeechDat II German
Corpus Specification
Meta Data of SpeechDat
Recording Protocol
Speaker Profiles
Comments to SpeechDat
Specification Documents
SmartKom
Corpus Specification
Transcription
Transcription Example
Meta Data
Recording Protocol
Speaker Profiles
SmartKom Recording Protocol
SmartKom Speaker Profile
Comments on SmartKom
Bibliography
Check Lists - Summary
Web References - Summary
BAS - Rules of Transcription
Aims and Objectives
Basic Transcription
Vowels
Vocalised r
Consonants
Reductions
Foreign Words
List of All Symbols
Accents
Morpheme Markers (+)
Compound Markers (#)
Function Word Markers (+)
BITS Projekt-Account 2004-06-01