• DocumentCode
    2999266
  • Title

    Perceptually based processing in automatic speech recognition

  • Author

    Hermansky, Hynelc ; Tsuga, Ihzuhiro ; Makino, Shozo ; Wakita, H.

  • Author_Institution
    Speech Technology Laboratory, Santa Barbara, California, U.S.A.
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    1971
  • Lastpage
    1974
  • Abstract
    The perceptually based linear predictive (PLP) speech analysis method is applied to isolated word automatic speech recognition (ASR). Low dimensionality of the PLP analysis vector, which is otherwise identical in form to the standard linear predictive (LP) analysis vector, allows for computational and storage savings in ASR. We show that in speaker-dependent recognition of the alpha-numeric vocabulary, the PLP method in VQ-based ASR yields similar recognition scores as does the standard ASR system. The main focus of the paper is on cross-speaker ASR. We demonstrate in experiments with vowel centroids of two male and one female speakers that PLP speech representation is more consistent with the underlying phonetic information than the standard LP method. Conclusions from the experiments are confirmed by superior performance of the PLP method in cross-speaker isolated word recognition.
  • Keywords
    Auditory system; Automatic speech recognition; Failure analysis; Humans; Isolation technology; Psychology; Speech analysis; Speech processing; Speech recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1168649
  • Filename
    1168649