The VERBMOBIL CDROMs

 Last update of this page: 2014-03-27
 
 

Detailed Information about the Verbmobil Corpus CDROMs

Overview and Order Information

CD VM 1.0.3 (16.12.93)
    496608 KB 63 Dialogues 209 Appointm. 1840 Turns
    History:
    1.0 : only signal files, cut in turns, with push button
    1.0.1 : Update : 6 missing turns in dialog N019K completed
    1.0.2 : Update : Filenames in dialog N016K corrected (wrong turn numbering), 5 missing turns in dialog N010K completed
    1.0.3 : Update : new edition of all signal files of Karlsruhe
    1.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 2.0 (17.05.94)
    399828 KB 81 Dialogues 227 Appointm. 1538 Turns
    History:
    2.0 : only signal files, cut in turns, with push button
    2.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 3.0 (02.11.94)
    284888 KB 45 Dialogues 184 Appointm. 1214 Turns
    History:
    3.0 : only signal files, cut in turns, with push button
    3.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 4.0 (13.04.95)
    390384 KB 72 Dialogues 181 Appointm. 1517 Turns
    History:
    4.0 : only signal files, cut in turns, with push button
    4.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 5.0 (01.06.95)
    624290 KB 101 Dialogues 256 Appointm. 2154 Turns
    History:
    5.0 : only signal files, cut in turns, with push button
    5.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 6.0 (15.07.95)
    576758 KB 147 Dialogues (125 amerikanisch, 22 'denglisch') 191 Appointm. 1828 Turns
    History:
    6.0 : only signal files, cut in turns, with push button

CD VM 7.0 (15.10.95)
    532480 KB 68 Dialogues 238 Appointm. 1739 Turns
    History:
    7.0 : only signal files, cut in turns, with push button
    7.0.1 : Update: some signal files from Bonn had no PhonDat 1 Header
    7.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 8.0 (30.08.95)
    483000 KB English 252 Dialogues 252 Appointm.
    History:
    8.0 : only signal files, cut in turns, with push button
    8.1 : signal files, transliterations, BAS edition
    8.1.1 : extended by 89 appointments

CD VM 12.0 (28.02.96)
    598016 KB 207 Dialogues 207 Appointm. 2154 Turns
    History:
    12.0 : only signal files, cut in turns, with push button
    12.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 13.0 (11.07.96)
    549219 KB 200 Dialogues (54 'denglisch', 146 amerikanisch) 200 Appointm. 1714 Turns
    History:
    13.0 : only signal files, cut in turns, with push button

CD VM 14.0 (01.10.96)
    541529 KB 156 Dialogues 156 Appointm. 1891 Turns
    History:
    14.0 : signal files, cut in turns, with push button

CD VM S 1.0 (01.03.94)
    580000 KB 26 Dialogues - 2227 Turns
    History:
    S 1.0 : free dialogs (Stereo Files STF) without button push
    S 1.1 : transliterations, BAS edition

CD VM 15.0 (19.02.98)
    652718 KB 57 Dialogues (19 close microphone, 19 room microphone, 19 telephone)
    3117 Turns (1039 close, 1039 room, 1039 telephon) German, Scenario a.

CD VM 16.0 (12.12.98) (Original Edition VMJP1_4-CD1.0)
    379453KB, 200 Dialogues, 3311 Turns
    Attention: Transliteration is not correct VM1 or VM2 convention

CD VM 17.0 (12.12.98) (Original Edition VMJP2_4-CD1.0)
    338349KB, 200 Dialogues, 2741 Turns
    Attention: Transliteration is not correct VM1 or VM2 convention

CD VM 18.0 (23.12.98) (Original Edition VMJP3_4-CD1.0)
    253809KB, 200 Dialogues, 2345 Turns
    Attention: Transliteration is not correct VM1 or VM2 convention

CD VM 19.0 (23.12.98) (Original Edition VMJP4_4-CD1.0)
    387793KB, 200 Dialogues, 2911 Turns
    Attention: Transliteration is not correct VM1 or VM2 convention

CD VM 20.0 (23.04.98)
    584723KB 48 Dialogues (10 close microphone, 28 room microphone, 10 telephone)
    1947 Turns (398 close, 1151 room, 398 telephone) German, 3x10 Dialogues Scenario a, 11 Dialogues Scenario b, 4 Files with
    backgroundnoise from a business fair.

CD VM 21.0 (02.07.98)
    550018KB 62 Dialogues (38 close microphone, 2 room microphone, 22 telephone)
    2331 Turns (1527 close, 90 room, 714 telephone) German Scenario "A".

CD VM 22.0 (28.08.98)
    439939KB  60 Dialogues (German Scenario "A": 28 close microphone, 27 telephone; German Scenario "B": 5 close microphone).
    2004 Turns (915 close, 216 room, 873 telephone).

CD VM 23.0 (04.09.98)
    648555KB, 28 Dialogues (all close microphone)
    2459 Turns (all close) English Scenario "A"

CD VM 24.0 (12.11.98)
    511689KB, 58 Dialogues (36 close microphone, 22 mobile telephone)
    2231 Turns (1454 close, 777 mobile telephone) German scenarios "A" (54 dialogues) and "B" (4 dialogues).

CD VM 25.0 (08.12.98)
    470275KB, 10 Dialogues (all close microphone)
    1654 Turns (all close) Japanese Scenario "A".

CD VM 26.0 (08.12.98)
    524650KB, 16 Dialogues (all close microphone)
    1319 Turns (all close) Japanese Scenario "A".

CD VM 27.0 (08.12.98)
    565104KB, 24 Dialogues (all close microphone)
    1149 Turns (all close) Japanese Scenario "A".

CD VM 28.0   (14.02.99)
    573042KB, 28 Dialoques (all close microphone)
    2409 Turns (all close) Pittsburgh Scenario "A".
    update CD VM 23.0.1 (05.09.2000): 2727 re-segmented turns

CD VM 30.0   (09.04.99)
    637465KB, 58 Dialoques (33 close microphone, 25 mobile telephone)
    3024 Turns (1718 close, 1306 telephone), German scenarios "A" (52 dialoques) und "B" (6 dialoques)
 

Presently the following volumes are available on CDROM at University of Munich:

CD VM 1.0.3 (16.12.93)
    496608 KB 63 Dialogues 209 Appointm. 1840 Turns
    History:
    1.0 : only signal files, cut in turns, with push button
    1.0.1 : Update : 6 missing turns in dialog N019K completed
    1.0.2 : Update : Filenames in dialog N016K corrected (wrong turn numbering), 5 missing turns in dialog N010K completed
    1.0.3 : Update : new edition of all signal files of Karlsruhe
    1.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 2.0 (17.05.94)
    399828 KB 81 Dialogues 227 Appointm. 1538 Turns
    History:
    2.0 : only signal files, cut in turns, with push button
    2.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 3.0 (02.11.94)
    284888 KB 45 Dialogues 184 Appointm. 1214 Turns
    History:
    3.0 : only signal files, cut in turns, with push button
    3.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 4.0 (13.04.95)
    390384 KB 72 Dialogues 181 Appointm. 1517 Turns
    History:
    4.0 : only signal files, cut in turns, with push button
    4.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 5.0 (01.06.95)
    624290 KB 101 Dialogues 256 Appointm. 2154 Turns
    History:
    5.0 : only signal files, cut in turns, with push button
    5.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 6.0 (15.07.95)
    576758 KB English 147 Dialogues 191 Appointm. 1828 Turns
    History:
    6.0 : only signal files, cut in turns, with push button

CD VM 7.0 (15.10.95)
    532480 KB 68 Dialogues 238 Appointm. 1739 Turns
    History:
    7.0 : only signal files, cut in turns, with push button
    7.0.1 : Update: some signal files from Bonn had no PhonDat 1 Header
    7.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 8.0 (30.08.95)
    483000 KB English 252 Dialogues 252 Appointm.
    History:
    8.0 : only signal files, cut in turns, with push button
    8.1 : signal files, transliterations, BAS edition
    8.1.1 : extended by 89 appointments

CD VM 12.0 (28.02.96)
    598016 KB 207 Dialogues 207 Appointm. 2154 Turns
    History:
    12.0 : only signal files, cut in turns, with push button
    12.1 : signal files with PhonDat 2 header (orthography + canonical transcript), transliterations, BAS edition

CD VM 13.0 (11.07.96)
    549219 KB 200 Dialogues (54 'denglisch', 146 amerikanisch) 200 Appointm. 1714 Turns
    History:
    13.0 : only signal files, cut in turns, with push button

CD VM 14.0 (01.10.96)
    541529 KB 156 Dialogues 156 Appointm. 1891 Turns
    History:
    14.0 : signal files, cut in turns, with push button

CD VM S 1.0 (01.03.94)
    580000 KB 26 Dialogues - 2227 Turns
    History:
    S 1.0 : free dialogs (Stereo Files STF) without button push
    S 1.1 : transliterations, BAS edition

CD VM 15.0 (19.02.98)
    652718 KB 57 Dialogues (19 close microphone, 19 room microphone, 19 telephone)
    3117 Turns (1039 close, 1039 room, 1039 telephon) German, Scenario a.

CD VM 20.0 (23.04.98)
    584723KB 48 Dialogues (10 close microphone, 28 room microphone, 10 telephone)
    1947 Turns (398 close, 1151 room, 398 telephon) German, 3x10 Dialogues Scenario a, 11 Dialogues Scenario b, 4 Files with
    backgroundnoise from a business fair.

CD VM 21.0 (02.07.98)
    550018KB 62 Dialogues (38 close microphone, 2 room microphone, 22 telephone)
    2331 Turns (1527close, 90 room, 714 telephone) German Scenario "A".

CD VM 22.0 (28.08.98)
    439939KB  60 Dialogues (German Scenario "A": 28 close microphone, 27 telephone; German Scenario "B": 5 close microphone).
    2004 Turns (915 close, 216 room, 873 telephone).

CD VM 23.0 (04.09.98)
    648555KB, 28 Dialogues (all close microphone)
    2459 Turns (all close) English Scenario "A"

CD VM 24.0 (12.11.98)
    511689KB, 58 Dialogues (36 close microphone, 22 mobile telephone)
    2231 Turns (1454 close, 777 mobile telephone) German scenarios "A" (54 dialogues) and "B" (4 dialogues).

CD VM 25.0 (08.12.98)
    470275KB, 10 Dialogues (all close microphone)
    1654 Turns (all close) Japanese Scenario "A".

CD VM 26.0 (08.12.98)
    524650KB, 16 Dialogues (all close microphone)
    1319 Turns (all close) Japanese Scenario "A".

CD VM 27.0 (08.12.98)
    565104KB, 24 Dialogues (all close microphone)
    1149 Turns (all close) Japanese Scenario "A".

CD VM 28.0   (14.0299)
    573042KB, 28 Dialogues (all close microphone)
    2409 Turns (all close) Pittsburgh Scenario "A".

CD VM 29.0 (19.07.99)
    389287KB, 25 Dialoques (25 close microphone, 20 mobile telephone)
    1870 Turns (1026 close, 844 telephone), Munich scenario "A" (21 Dialogues) , 2 Bonn scenario "A", 2 Bonn scenario "B"

CD VM 30.0   (09.04.99)
    637465KB, 58 Dialogues (33 close microphone, 25 mobile telephone)
    3024 Turns (1718 close, 1306 telephone), German scenarios "A" (52 dialoques) and "B" (6 dialoques)

CD VM 31.0   (16.06.99)
    606693KB,  32 Dialogues (all close microphone)
    2512 Turns (all close), Pittsburgh Scenario "A" (32 dialogues)

CD VM 32.0   (24.06.99)
    601748KB, 17 multilingual WOZ-Dialogues English/German (all close microphone)
    992 Turns (all close), Hamburg scenario "A" (7 dialogues) and "B" (10 dialogues)

CD VM 33.0   (19.07.99)
    554279KB, 25 Dialogues (all close microphone)
    1050 Turns (all close), Kyoto/ Tokyo scenarios "A"

CD VM 34.0   (19.07.99)
    544947KB, 28 Dialogues (all close microphone)
    1437 Turns (all close), Kyoto/ Tokyo scenarios "A"

CD VM 35.0   (19.07.99)
    609479KB, 27 Dialogues (all close microphone)
    1645 Turns (all close), Kyoto/ Tokyo scenarios "A"

CD VM 36.0   (23.07.99)
    483199KB, 46 Dialogues (room microphone)
    1523 Turns (all room), Munich scenario "A"

CD VM 37.0   (23.07.99)
    490771KB, 34 Dialogues (room microphone)
    1521 Turns (all room), Munich scenario "A"

CD VM 38.0   (15.09.99)
    649693KB, 33 Dialogues (33 close microphone, 28 mobile telephone)
    3483 Turns (1886 close, 1597 telephone), Munich scenario "A"

CD VM 39.0   (15.09.99)
    585812KB, 2475 Turns (1483 close, 992 telephone)
    31 Dialogues, multilingual german-english, end-2-end-evaluation 12/98
    20 Dialogues, Munich scenario "A" (close microphone, mobile telephone)
    8 Dialogues, Bonn scenario "B" (close microphone)

CD VM 40.0   (15.09.99)
    437346KB, 33 Dialogues (room microphone)
    1378 Turns (all room), Munich scenario "A"

CD VM 41.0   (15.09.99)
    562141KB, 32 Dialogues (room microphone)
    1977 Turns (all room), Munich scenario "A"

CD VM 42.0   (17.09.99)
    442470KB, 20 Dialogues (close microphone)
    1874 Turns (all close), Pittsburgh scenario "A"

CD VM 43.0   (17.09.99)
    254922KB, 11 Dialogues (close microphone)
    633 Turns (all close), Pittsburgh scenario "A"

CD VM 44.0   (13.12.99)
    383833KB, 19 Dialogues (close microphone)
    920 Turns (all close), Tokyo/ Kyoto scenario "A"

CD VM 45.0   (13.12.99)
    419471KB, 21 Dialogues (close micophone)
    1293 Turns (all close), Tokyo/ Kyoto scenario "A"

CD VM 46.0 (21.09.2000)
    591279KB, 11 multilingual Dialogues  japanese/ german
    all close microphone, except Dialoques 234-238, channel 4, recorded with room microphone
    607 Turns,  9 Hamburg scenario "A", 2 Hamburg scenario "B"

CD VM 47.0 (21.06.2000)
    552449KB,  multilingual WOZ-Dialogues english/ german (close microphone)
    902 Turns (all close), Hamburg scenario "A"

CD VM 48.0   (05.01.2000)
    569647KB,  28 Dialogues (28 close microphone, 27 telephone microphone)
    2996 Turns (1520 close, 1476 telephone), Munich scenario "A"

CD VM 49.0   (10.01.2000)
    396515KB, 24 Dialogues
    1917 Turns (1237 close, 680 telephone), 12 Munich scenario "A" (close/ telephone microphone), 12 Bonn scenario "B" (close mirophone)

CD VM 50.0   (10.01.2000)
    155216KB, 8 Dialogues (close microphone)
    679 Turns (all close), Pittsburgh scenario "A"

CD VM 51.0 (03.07.2000)
    569647KB,  15 Dialogues english/ german (close microphone)
    873 Turns (all close), 11 Hamburg scenario "A", 4 Hamburg scenario "B"

CD VM 52.0   (10.08.2000)
    494372KB,  13 Dialogues englisch/ german (close microphone)
    728 Turns (all close), 1 Hamburg scenario "A", 12 Hamburg scenario "B"

CD VM 53.0   (10.08.2000)
     14859KB,  16 Dialogues german/ german
     1771 Turns (all close), 8 Bonn scenario "B" (close microphone),  8 Munich scenario "A" (close/ telephone/ room microphone)

CD VM 54.0 (10.08.2000)
    572253KB, room microphone Dialogues of CD48 and CD49

CD VM 55.0 (29.08.2000)
    349584KB, 11 Dialogues englisch/ german (close microphone)
    518 Turns (all close), 7 Hamburg scenario "A", 4 Hamburg scenario "B"

CD VM 56.0 (05.09.2000)
    363621KB, 12 Dialogues englisch/ german (close microphone)
    620 Turns (all close), 7 Hamburg scenario "A", 5 Hamburg scenario "B"

CD VM 57.0 (21.09.2000)
     631317KB,  11 multilingual Dialogues  japanese/ german (close microphone)
     702 Turns,  8 Hamburg scenario "A", 3 Hamburg scenario "B"

CD VM 58.0 (21.09.2000)
    400781KB,  7 multilingual Dialogues  japanese/ german (close microphone)
    421 Turns,  4 Hamburg scenario "A", 3 Hamburg scenario "B"

CD VM 59.0 (21.09.2000)
    366378KB, 7 multilingual Dialogues  japanese/ german (close microphone)
    354 Turns, 7 Hamburg scenario "B"

CD VM 60.0 (15.09.2000)
   The dialogues on this CD were intended to be used as a test set for the
   final evaluation of the japanese speech recogniser in the Verbmobil project.

CD VM 61.0 (15.09.2000)
    372601KB, 19 Dialogues japanese/ japanese (all close)
    946 Turns, main scenario

CD VM 62.0 (15.09.2000)
    455845 KB, 20 Dialoques japanese/ japanese (all close)
    981 Turns, main scenario

CD VM 63.0 (21.09.2000)
    620066KB, data with invoked emotions, collected in Erlangen

CD VM 64.0 (21.09.2000)
    615348KB, data with invoked emotions, collected in Erlangen

CD VM 65.0 (21.09.2000)
    637937KB, data with invoked emotions, collected in Erlangen
 

Each volume is stored on a CDROM ISO 9660 (High Sierra File System) which can be read on all platforms.

The handbook of data collection and transliteration in TP14 of VERBMOBIL was produced by IPDS Kiel (VERBMOBIL techdok-11-94.ps).
A new version of the handbook for VERBMOBIL II can be found in the Transliterationslexikon VERBMOBIL II.

 Backround information about the recordings can be found in the Verbmobil speaker database.

Software to read/write PhonDat format, to transform PhonDat in NIST or vice versa, to create PhonDat headers from rawfiles, to listen to PhonDat files are stored on each CDROM. You can retrieve the latest version of the software under the following address:

 host: ftp.phonetik.uni-muenchen.de
user: anonymous
dir: /pub/software/phondat

The volumes are blocked for free distribution for one year after edition. Only VERBMOBIL partners have access to the data within the first year. After that period they are transfered to the ELRA (European Language Resources Agency) and can be ordered from the Bavarian Archive for Speech Signals (BAS) in Munich.

For orders and questions by VERBMOBIL partners please contact

For orders and questions by others please contact


The VERBMOBIL Project


Florian Schiel