• DocumentCode
    977673
  • Title

    Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress

  • Author

    Hansen, John H L ; Clements, Mark A.

  • Author_Institution
    Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
  • Volume
    3
  • Issue
    5
  • fYear
    1995
  • fDate
    9/1/1995 12:00:00 AM
  • Firstpage
    407
  • Lastpage
    415
  • Abstract
    Studies have shown that depending on speaker task and environmental conditions, recognizers are sensitive to noisy stressful environments. The focus of the study is to achieve robust recognition in diverse environmental conditions through the formulation of feature enhancement and stress equalization algorithms under the framework of source generator theory. The generator framework is considered as a means of modeling production variation under stressful speaking conditions. A multi-dimensional stress equalization procedure is formulated that produces recognition features less sensitive to varying factors caused by stress. A feature enhancement algorithm is employed based on iterative techniques previously derived for enhancement of speech in varying background noise environments. Combined stress equalization and feature enhancement reduces average word error rates across 10 noisy stressful conditions by -38.7% (e.g., noisy loud, angry, and Lombard effect stress conditions, etc.). The results suggest that the combination of a flexible source generator framework to address stressed speaking conditions, and a feature enhancement algorithm that adapts based on speech-specific constraints, can be effective in reducing the consequences of stress and noise for robust automatic recognition
  • Keywords
    equalisers; error statistics; feature extraction; iterative methods; speech enhancement; speech recognition; enhancement; feature enhancement; iterative techniques; noisy stressful environments; production variation; robust automatic recognition; robust speech recognition; source generator equalization; spectral properties; stress equalization algorithms; word error rates; Automatic speech recognition; Background noise; Error analysis; Iterative algorithms; Noise generators; Noise reduction; Noise robustness; Speech enhancement; Stress; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.466655
  • Filename
    466655