DocumentCode :
3077120
Title :
Pitch and spectral estimation of speech based on auditory synchrony model
Author :
Seneff, Stephanie
Author_Institution :
Massachusetts Institute of Technology, Cambridge, Massachusetts
Volume :
9
fYear :
1984
fDate :
30742
Firstpage :
45
Lastpage :
48
Abstract :
This paper describes a system for processing sonorant regions of speech, motivated by knowledge of the human auditory system. The spectral representation is intended to reflect a proposed model for human auditory processing of speech, which takes advantage of synchrony in the nerve firing patterns to enhance formant peaks. The auditory model is also applied to pitch extraction, and thus a temporal pitch processor is envisioned. The spectrum is derived from the outputs of a set of linear fillers with critical bandwidths. Saturation and adaptation are incorporated for each filter independently. Each "spectral" coefficient is determined by weighting the amplitude response at that frequency by a measure of synchrony to the center frequency of the filter. Pitch is derived front a waveform generated by adding the rectified filter outputs across the frequency dimension. The spectral estimator and the pitch estimator are illustrated by processing pure tones and natural speech.
Keywords :
Auditory system; Frequency estimation; Frequency measurement; Frequency synchronization; Humans; Natural languages; Nerve fibers; Nonlinear filters; Speech analysis; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
Type :
conf
DOI :
10.1109/ICASSP.1984.1172757
Filename :
1172757
Link To Document :
بازگشت