• DocumentCode
    3598082
  • Title

    Perceptually-based features in ASR

  • Author

    Mason, J.S.D. ; Gu, Y.

  • Author_Institution
    Dept. of Electr. Eng., Univ. Coll. of Swansea, UK
  • fYear
    1988
  • fDate
    1/19/1988 12:00:00 AM
  • Firstpage
    42552
  • Lastpage
    42555
  • Abstract
    Perceptually-based linear predictive (PLP) speech analysis, as proposed by Hermansky 1985, can have marked benefits in ASR (automatic speech recognition) systems. Four psychoacoustic factors are considered in PLP analysis, namely critical-band, masking effect, equal-loudness and intensity-loudness law. This paper presents experimental results aimed at illustrating the relative importance of each of these in the context of ASR. It is shown that the [J] SRU filter bank can be incorporated into the PLP process with very similar overall results. The ASR system is based on dynamic time warping (DTW), and a vocabulary consisting of the alphabet and zero-through-nine is used for tests
  • Keywords
    speech analysis and processing; speech recognition; automatic speech recognition; critical-band; dynamic time warping; equal-loudness; intensity-loudness law; linear predictive; masking effect; psychoacoustic factors; speech analysis;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Speech Processing, IEE Colloquium on
  • Type

    conf

  • Filename
    208669