مرکز منطقه ای اطلاع رساني علوم و فناوري - HTIMIT and LLHDB: speech corpora for the study of handset transducer effects

DocumentCode :

2404374

Title :

HTIMIT and LLHDB: speech corpora for the study of handset transducer effects

Author :

Reynolds, Douglas A.

Author_Institution :

Lincoln Lab., MIT, Lexington, MA, USA

Volume :

fYear :

1997

fDate :

21-24 Apr 1997

Firstpage :

1535

Abstract :

This paper describes two corpora collected at Lincoln Laboratory for the study of handset transducer effects on the speech signal: the handset TIMIT (HTIMIT) corpus and the Lincoln Laboratory Handset Database (LLHDB). The goal of these corpora are to minimize all confounding factors and produce speech predominately differing only in handset transducer effects. The speech is recorded directly from a telephone unit in a sound-booth using prompted text and extemporaneous photograph descriptions. The two corpora allow comparison of speech collected from a person speaking into a handset (LLHDB) versus speech played through a loudspeaker into a handset (HTIMIT). A comparison of analysis and results between the two corpora addresses the realism of artificially creating handset degraded speech by playing recorded speech through the handsets. The corpora are designed primarily for speaker recognition experimentation (in terms of amount of speech and level of transcription), but since both speaker and speech recognition systems operate on the same acoustic features affected by the handset, the knowledge gleaned is directly transferable to speech recognizers. Initial speaker identification performance on these corpora are presented. In addition, the application of HTIMIT in developing a handset detector that was successfully used on a Switchboard speaker verification task is described

Keywords :

acoustic signal processing; acoustic transducers; loudspeakers; speech processing; speech recognition; telephone sets; HTIMIT; LLHD; Lincoln Laboratory; Lincoln Laboratory Handset Database; Switchboard speaker verification task; acoustic features; handset TIMIT corpus; handset degraded speech; handset detector; handset transducer effects; loudspeaker; sound booth; speaker identification performance; speaker recognition experimentation; speaker recognition systems; speech corpora; speech recognition systems; speech recording; speech signal; telephone unit; transcription level; Acoustic transducers; Databases; Degradation; Laboratories; Loudspeakers; Speaker recognition; Speech analysis; Speech recognition; Telephone sets; Telephony;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location :

Munich

ISSN :

1520-6149

Print_ISBN :

0-8186-7919-0

Type :

conf

DOI :

10.1109/ICASSP.1997.596243

Filename :

596243

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2404374