DocumentCode
2697217
Title
The 2005 AFRL/HEC One-Speaker Detection Systems
Author
Slyh, Raymond E. ; Hansen, Eric G. ; Ore, Brian M.
Author_Institution
Human Effectiveness Directorate, Air Force Res. Lab., Wright-Patterson AFB, OH
fYear
2006
fDate
28-30 June 2006
Firstpage
1
Lastpage
8
Abstract
This paper describes the one-speaker detection systems submitted by AFRL/HEC for several of the training and testing conditions in the 2005 NIST speaker recognition evaluation. For each condition, the overall system score was the weighted combination of scores from several component systems. The component systems were based on (1) mel-frequency cepstral coefficients (MFCCs) and Gaussian mixture models (GMMs); (2) MFCCs and phoneme-specific GMMs (PS-GMMs); (3) linear-prediction-based cepstral coefficients (LPCCs) from closed-phase analysis; (4) formant center frequencies, formant bandwidths, and fundamental frequency (FMBWF0); and (5) word language modeling (WLM). The score combination was done using single-layer perceptrons, with the grouping of the component systems depending on the lengths of the training and testing files. For some of the testing and/or training conditions involving ten-second speech files, the system performance improved from the inclusion of the FMBWFO and LPCC systems, while the MFCC/PS-GMM system provided additional benefits in the one-conversation testing conditions involving larger amounts of training data
Keywords
Gaussian processes; cepstral analysis; speaker recognition; AFRL-HEC system; LPCC; MFCC-PS-GMM system; NIST speaker recognition evaluation; WLM; closed-phase analysis; linear-prediction-based cepstral coefficient; mel-frequency cepstral coefficient; one-speaker detection system; phoneme-specific Gaussian mixture model; training data; word language modeling; Bandwidth; Cepstral analysis; Laboratories; Mel frequency cepstral coefficient; NIST; Natural languages; Speaker recognition; Speech recognition; System performance; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location
San Juan
Print_ISBN
1-424400471-1
Electronic_ISBN
1-4244-0472-X
Type
conf
DOI
10.1109/ODYSSEY.2006.248119
Filename
4013536
Link To Document