DocumentCode :
2404374
Title :
HTIMIT and LLHDB: speech corpora for the study of handset transducer effects
Author :
Reynolds, Douglas A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1535
Abstract :
This paper describes two corpora collected at Lincoln Laboratory for the study of handset transducer effects on the speech signal: the handset TIMIT (HTIMIT) corpus and the Lincoln Laboratory Handset Database (LLHDB). The goal of these corpora are to minimize all confounding factors and produce speech predominately differing only in handset transducer effects. The speech is recorded directly from a telephone unit in a sound-booth using prompted text and extemporaneous photograph descriptions. The two corpora allow comparison of speech collected from a person speaking into a handset (LLHDB) versus speech played through a loudspeaker into a handset (HTIMIT). A comparison of analysis and results between the two corpora addresses the realism of artificially creating handset degraded speech by playing recorded speech through the handsets. The corpora are designed primarily for speaker recognition experimentation (in terms of amount of speech and level of transcription), but since both speaker and speech recognition systems operate on the same acoustic features affected by the handset, the knowledge gleaned is directly transferable to speech recognizers. Initial speaker identification performance on these corpora are presented. In addition, the application of HTIMIT in developing a handset detector that was successfully used on a Switchboard speaker verification task is described
Keywords :
acoustic signal processing; acoustic transducers; loudspeakers; speech processing; speech recognition; telephone sets; HTIMIT; LLHD; Lincoln Laboratory; Lincoln Laboratory Handset Database; Switchboard speaker verification task; acoustic features; handset TIMIT corpus; handset degraded speech; handset detector; handset transducer effects; loudspeaker; sound booth; speaker identification performance; speaker recognition experimentation; speaker recognition systems; speech corpora; speech recognition systems; speech recording; speech signal; telephone unit; transcription level; Acoustic transducers; Databases; Degradation; Laboratories; Loudspeakers; Speaker recognition; Speech analysis; Speech recognition; Telephone sets; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596243
Filename :
596243
Link To Document :
بازگشت