DocumentCode :
2697217
Title :
The 2005 AFRL/HEC One-Speaker Detection Systems
Author :
Slyh, Raymond E. ; Hansen, Eric G. ; Ore, Brian M.
Author_Institution :
Human Effectiveness Directorate, Air Force Res. Lab., Wright-Patterson AFB, OH
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
1
Lastpage :
8
Abstract :
This paper describes the one-speaker detection systems submitted by AFRL/HEC for several of the training and testing conditions in the 2005 NIST speaker recognition evaluation. For each condition, the overall system score was the weighted combination of scores from several component systems. The component systems were based on (1) mel-frequency cepstral coefficients (MFCCs) and Gaussian mixture models (GMMs); (2) MFCCs and phoneme-specific GMMs (PS-GMMs); (3) linear-prediction-based cepstral coefficients (LPCCs) from closed-phase analysis; (4) formant center frequencies, formant bandwidths, and fundamental frequency (FMBWF0); and (5) word language modeling (WLM). The score combination was done using single-layer perceptrons, with the grouping of the component systems depending on the lengths of the training and testing files. For some of the testing and/or training conditions involving ten-second speech files, the system performance improved from the inclusion of the FMBWFO and LPCC systems, while the MFCC/PS-GMM system provided additional benefits in the one-conversation testing conditions involving larger amounts of training data
Keywords :
Gaussian processes; cepstral analysis; speaker recognition; AFRL-HEC system; LPCC; MFCC-PS-GMM system; NIST speaker recognition evaluation; WLM; closed-phase analysis; linear-prediction-based cepstral coefficient; mel-frequency cepstral coefficient; one-speaker detection system; phoneme-specific Gaussian mixture model; training data; word language modeling; Bandwidth; Cepstral analysis; Laboratories; Mel frequency cepstral coefficient; NIST; Natural languages; Speaker recognition; Speech recognition; System performance; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
Type :
conf
DOI :
10.1109/ODYSSEY.2006.248119
Filename :
4013536
Link To Document :
بازگشت