• DocumentCode
    968921
  • Title

    Dynamic formant tracking of noisy speech using temporal analysis on outputs from a nonlinear cochlear model

  • Author

    Deng, Li ; Kheirallah, Issam

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
  • Volume
    40
  • Issue
    5
  • fYear
    1993
  • fDate
    5/1/1993 12:00:00 AM
  • Firstpage
    456
  • Lastpage
    467
  • Abstract
    The authors take a modeling approach to studying representation of formant frequencies of spoken speech and speech in noise in the temporal responses of the peripheral auditory system. On the basis of the properties of the representation, they have devised and evaluated a cross-channel correlation algorithm and an interpeak interval analysis for automatic formant extraction of speech which is strongly dynamic in acoustic characteristics and is embedded in noise. The basilar membrane model used in this study contains laterally coupled damping elements, which are made monotonically dependent on the spatial distribution of the short-term power in the outputs of the model. Efficient digital implementation and the related salient numerical properties of the model are described. Simulation results from the model in response to speech and speech in noise illustrate temporal response patterns that are tonotopically organized in relation to speech formant parameters with little influence by the noise level. By utilizing such relations the devised cross-channel correlation algorithm is shown to be capable of accurately tracking formant movements in spoken syllables and sentences.
  • Keywords
    ear; physiological models; speech analysis and processing; cross-channel correlation algorithm; dynamic formant tracking; efficient digital implementation; formant frequencies; interpeak interval analysis; laterally coupled damping elements; monotonic dependence; noisy speech; nonlinear cochlear model; salient numerical properties; sentences; spoken syllables; temporal analysis; Acoustic noise; Algorithm design and analysis; Auditory system; Biomembranes; Frequency; Noise level; Nonlinear dynamical systems; Power system modeling; Speech analysis; Speech enhancement; Algorithms; Cochlear Duct; Evaluation Studies as Topic; Fourier Analysis; Humans; Models, Statistical; Noise; Signal Processing, Computer-Assisted; Sound Spectrography; Speech Acoustics; Speech Perception; Time Factors;
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/10.243416
  • Filename
    243416