Description

The SIGNUM Database contains both isolated and continuous utterances of various signers. Since we use a vision-based approach for sign language recognition the corpus was recorded on video. For quick random access to individual frames, each video clip is stored as a sequence of images.

The vocabulary comprises 450 basic signs in German Sign Language (DGS) representing different word types. Based on this vocabulary, overall 780 sentences were constructed. Each sentence ranges from two to eleven signs in length. No intentional pauses are placed between signs within a sentence, but the sentences themselves are separated. The entire corpus, i.e. all 450 basic signs and all 780 sentences, was performed once by 25 native signers of different sexes and ages. One of them was chosen to be the so-called reference signer. His performances were recorded not once but even three times. For more information see the concept of the corpus.

The following table summarizes the most important details about the recorded sign language database.

  General Information  
    Name: SIGNUM Database
    Author: Ulrich von Agris
  Corpus Content  
    Language: German Sign Language (DGS)
    Vocabulary size: 450 basic signs
    Number of signers: 25 native signers
    Number of isolated signs: 450
    Number of continuous sentences: 780
    Number of performances:
      • Reference signer 3
      • Other signers 1
    Total number of sequences: 33,210
    Total number of images: 5,970,450
    Equivalent video duration: 55.3h
  Technical Details  
    Image resolution: 776x578, 30fps, 24bpp, color
    Image format: JPEG
    Data volume: 920GB (approx.)
    Medium: 1 hard disk