• DocumentCode
    2697217
  • Title

    The 2005 AFRL/HEC One-Speaker Detection Systems

  • Author

    Slyh, Raymond E. ; Hansen, Eric G. ; Ore, Brian M.

  • Author_Institution
    Human Effectiveness Directorate, Air Force Res. Lab., Wright-Patterson AFB, OH
  • fYear
    2006
  • fDate
    28-30 June 2006
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper describes the one-speaker detection systems submitted by AFRL/HEC for several of the training and testing conditions in the 2005 NIST speaker recognition evaluation. For each condition, the overall system score was the weighted combination of scores from several component systems. The component systems were based on (1) mel-frequency cepstral coefficients (MFCCs) and Gaussian mixture models (GMMs); (2) MFCCs and phoneme-specific GMMs (PS-GMMs); (3) linear-prediction-based cepstral coefficients (LPCCs) from closed-phase analysis; (4) formant center frequencies, formant bandwidths, and fundamental frequency (FMBWF0); and (5) word language modeling (WLM). The score combination was done using single-layer perceptrons, with the grouping of the component systems depending on the lengths of the training and testing files. For some of the testing and/or training conditions involving ten-second speech files, the system performance improved from the inclusion of the FMBWFO and LPCC systems, while the MFCC/PS-GMM system provided additional benefits in the one-conversation testing conditions involving larger amounts of training data
  • Keywords
    Gaussian processes; cepstral analysis; speaker recognition; AFRL-HEC system; LPCC; MFCC-PS-GMM system; NIST speaker recognition evaluation; WLM; closed-phase analysis; linear-prediction-based cepstral coefficient; mel-frequency cepstral coefficient; one-speaker detection system; phoneme-specific Gaussian mixture model; training data; word language modeling; Bandwidth; Cepstral analysis; Laboratories; Mel frequency cepstral coefficient; NIST; Natural languages; Speaker recognition; Speech recognition; System performance; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
  • Conference_Location
    San Juan
  • Print_ISBN
    1-424400471-1
  • Electronic_ISBN
    1-4244-0472-X
  • Type

    conf

  • DOI
    10.1109/ODYSSEY.2006.248119
  • Filename
    4013536