Title :
Speech recognition using an auditory model with pitch-synchronous analysis
Author :
Hunt, Melvyn J. ; Lefebvre, Claude
Author_Institution :
National Research Council of Canada, Ottawa, Canada
Abstract :
An auditory model with two-tone suppression has previously been shown to perform better in speech recognition experiments than a conventional filterbank representation, particularly with noisy or distorted speech. It was, however, known to have several defects including an uneven response across the spectrum and a tendency to detect harmonics of F0rather than F1. We show that instants of glottal excitation can be derived from the model even with noisy speech. By using this information to carry out pitch-synchronous analysis in a slightly modified model the problem of interaction with harmonics of F0can be solved. An analysis of the behavior of the model leads to a specification of a class of processes showing two-tone suppression and hence to a redesigned model avoiding the known defects. The pitch-synchronous analysis is then no longer necessary, but the robust indication of excitation points may have other uses. Spectrograms from the old and new models illustrate the improvements obtained.
Keywords :
Councils; Filter bank; Frequency; Harmonic analysis; Mechanical factors; Performance analysis; Power harmonic filters; Psychoacoustic models; Speech analysis; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
DOI :
10.1109/ICASSP.1987.1169585