• DocumentCode
    388047
  • Title

    Speech recognition using an auditory model with pitch-synchronous analysis

  • Author

    Hunt, Melvyn J. ; Lefebvre, Claude

  • Author_Institution
    National Research Council of Canada, Ottawa, Canada
  • Volume
    12
  • fYear
    1987
  • fDate
    31868
  • Firstpage
    813
  • Lastpage
    816
  • Abstract
    An auditory model with two-tone suppression has previously been shown to perform better in speech recognition experiments than a conventional filterbank representation, particularly with noisy or distorted speech. It was, however, known to have several defects including an uneven response across the spectrum and a tendency to detect harmonics of F0rather than F1. We show that instants of glottal excitation can be derived from the model even with noisy speech. By using this information to carry out pitch-synchronous analysis in a slightly modified model the problem of interaction with harmonics of F0can be solved. An analysis of the behavior of the model leads to a specification of a class of processes showing two-tone suppression and hence to a redesigned model avoiding the known defects. The pitch-synchronous analysis is then no longer necessary, but the robust indication of excitation points may have other uses. Spectrograms from the old and new models illustrate the improvements obtained.
  • Keywords
    Councils; Filter bank; Frequency; Harmonic analysis; Mechanical factors; Performance analysis; Power harmonic filters; Psychoacoustic models; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1987.1169585
  • Filename
    1169585