• DocumentCode
    1134881
  • Title

    Robust AM-FM Features for Speech Recognition

  • Author

    Dimitriadis, Dimitrios ; Maragos, Petros ; Potamianos, Alexandros

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Greece
  • Volume
    12
  • Issue
    9
  • fYear
    2005
  • Firstpage
    621
  • Lastpage
    624
  • Abstract
    In this letter, a nonlinear AM-FM speech model is used to extract robust features for speech recognition. The proposed features measure the amount of amplitude and frequency modulation that exists in speech resonances and attempt to model aspects of the speech acoustic information that the commonly used linear source-filter model fails to capture. The robustness and discriminability of the AM-FM features is investigated in combination with mel cepstrum coefficients (MFCCs). It is shown that these hybrid features perform well in the presence of noise, both in terms of phoneme-discrimination (J-measure) and in terms of speech recognition performance in several different tasks. Average relative error rate reduction up to 11% for clean and 46% for mismatched noisy conditions is achieved when AM-FM features are combined with MFCCs.
  • Keywords
    amplitude modulation; feature extraction; frequency modulation; signal denoising; speech recognition; J-measure; MFCC; amplitude modulation; feature extraction; frequency modulation; mel cepstrum coefficient; nonlinear AM-FM speech model; phoneme-discrimination; speech acoustic information; speech recognition; Acoustic measurements; Acoustic noise; Cepstrum; Data mining; Feature extraction; Frequency measurement; Frequency modulation; Noise robustness; Resonance; Speech recognition; AM-FM; ASR; features; nonlinear; speech;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2005.853050
  • Filename
    1495427