DocumentCode :
3598082
Title :
Perceptually-based features in ASR
Author :
Mason, J.S.D. ; Gu, Y.
Author_Institution :
Dept. of Electr. Eng., Univ. Coll. of Swansea, UK
fYear :
1988
fDate :
1/19/1988 12:00:00 AM
Firstpage :
42552
Lastpage :
42555
Abstract :
Perceptually-based linear predictive (PLP) speech analysis, as proposed by Hermansky 1985, can have marked benefits in ASR (automatic speech recognition) systems. Four psychoacoustic factors are considered in PLP analysis, namely critical-band, masking effect, equal-loudness and intensity-loudness law. This paper presents experimental results aimed at illustrating the relative importance of each of these in the context of ASR. It is shown that the [J] SRU filter bank can be incorporated into the PLP process with very similar overall results. The ASR system is based on dynamic time warping (DTW), and a vocabulary consisting of the alphabet and zero-through-nine is used for tests
Keywords :
speech analysis and processing; speech recognition; automatic speech recognition; critical-band; dynamic time warping; equal-loudness; intensity-loudness law; linear predictive; masking effect; psychoacoustic factors; speech analysis;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Speech Processing, IEE Colloquium on
Type :
conf
Filename :
208669
Link To Document :
بازگشت