DocumentCode :
2999266
Title :
Perceptually based processing in automatic speech recognition
Author :
Hermansky, Hynelc ; Tsuga, Ihzuhiro ; Makino, Shozo ; Wakita, H.
Author_Institution :
Speech Technology Laboratory, Santa Barbara, California, U.S.A.
Volume :
11
fYear :
1986
fDate :
31503
Firstpage :
1971
Lastpage :
1974
Abstract :
The perceptually based linear predictive (PLP) speech analysis method is applied to isolated word automatic speech recognition (ASR). Low dimensionality of the PLP analysis vector, which is otherwise identical in form to the standard linear predictive (LP) analysis vector, allows for computational and storage savings in ASR. We show that in speaker-dependent recognition of the alpha-numeric vocabulary, the PLP method in VQ-based ASR yields similar recognition scores as does the standard ASR system. The main focus of the paper is on cross-speaker ASR. We demonstrate in experiments with vowel centroids of two male and one female speakers that PLP speech representation is more consistent with the underlying phonetic information than the standard LP method. Conclusions from the experiments are confirmed by superior performance of the PLP method in cross-speaker isolated word recognition.
Keywords :
Auditory system; Automatic speech recognition; Failure analysis; Humans; Isolation technology; Psychology; Speech analysis; Speech processing; Speech recognition; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type :
conf
DOI :
10.1109/ICASSP.1986.1168649
Filename :
1168649
Link To Document :
بازگشت