DocumentCode
388047
Title
Speech recognition using an auditory model with pitch-synchronous analysis
Author
Hunt, Melvyn J. ; Lefebvre, Claude
Author_Institution
National Research Council of Canada, Ottawa, Canada
Volume
12
fYear
1987
fDate
31868
Firstpage
813
Lastpage
816
Abstract
An auditory model with two-tone suppression has previously been shown to perform better in speech recognition experiments than a conventional filterbank representation, particularly with noisy or distorted speech. It was, however, known to have several defects including an uneven response across the spectrum and a tendency to detect harmonics of F0 rather than F1 . We show that instants of glottal excitation can be derived from the model even with noisy speech. By using this information to carry out pitch-synchronous analysis in a slightly modified model the problem of interaction with harmonics of F0 can be solved. An analysis of the behavior of the model leads to a specification of a class of processes showing two-tone suppression and hence to a redesigned model avoiding the known defects. The pitch-synchronous analysis is then no longer necessary, but the robust indication of excitation points may have other uses. Spectrograms from the old and new models illustrate the improvements obtained.
Keywords
Councils; Filter bank; Frequency; Harmonic analysis; Mechanical factors; Performance analysis; Power harmonic filters; Psychoacoustic models; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type
conf
DOI
10.1109/ICASSP.1987.1169585
Filename
1169585
Link To Document