• DocumentCode
    3433467
  • Title

    Multi-band speech recognition in noisy environments

  • Author

    Okawa, Shigeki ; Bocchieri, Enrico ; Potamianos, Alexaridros

  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    641
  • Abstract
    This paper presents a new approach for multi-band based automatic speech recognition (ASR). Previous work by Bourlard et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.426-9, 1996) and Hermansky et al. (see Proc. Int. Conf. on Spoken Language Processing, Philadelphia, p.1579-82, 1996) suggests that multi-band ASR gives a more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternative method, namely feature recombination (FC). In the FC system, after different acoustic analyzers are applied to each sub-band individually, a vector is composed by combining the sub-band features. The speech classifier then calculates the likelihood from the single vector. Thus, band-limited noise affects only a few of the feature components, as in the multi-band LC system, but, at the same time, all feature components are jointly modeled, as in conventional ASR. The experimental results show that the FC system can yield better performance than both the conventional ASR and the LC strategy for noisy speech
  • Keywords
    acoustic analysis; acoustic noise; acoustic signal processing; feature extraction; pattern classification; speech processing; speech recognition; acoustic analyzers; band-limited noise; experimental results; feature recombination; frequency bands; likelihood recombination; multi-band automatic speech recognition; noisy acoustic environments; noisy speech; performance; speech classifier; sub-band features; Acoustic noise; Additive noise; Automatic speech recognition; Filter bank; Frequency; Noise robustness; Psychoacoustic models; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675346
  • Filename
    675346