DocumentCode
3598082
Title
Perceptually-based features in ASR
Author
Mason, J.S.D. ; Gu, Y.
Author_Institution
Dept. of Electr. Eng., Univ. Coll. of Swansea, UK
fYear
1988
fDate
1/19/1988 12:00:00 AM
Firstpage
42552
Lastpage
42555
Abstract
Perceptually-based linear predictive (PLP) speech analysis, as proposed by Hermansky 1985, can have marked benefits in ASR (automatic speech recognition) systems. Four psychoacoustic factors are considered in PLP analysis, namely critical-band, masking effect, equal-loudness and intensity-loudness law. This paper presents experimental results aimed at illustrating the relative importance of each of these in the context of ASR. It is shown that the [J] SRU filter bank can be incorporated into the PLP process with very similar overall results. The ASR system is based on dynamic time warping (DTW), and a vocabulary consisting of the alphabet and zero-through-nine is used for tests
Keywords
speech analysis and processing; speech recognition; automatic speech recognition; critical-band; dynamic time warping; equal-loudness; intensity-loudness law; linear predictive; masking effect; psychoacoustic factors; speech analysis;
fLanguage
English
Publisher
iet
Conference_Titel
Speech Processing, IEE Colloquium on
Type
conf
Filename
208669
Link To Document