DocumentCode
749485
Title
Perceptual phase quantization of speech
Author
Kim, Doh-Suk
Author_Institution
Human & Comput. Interaction Lab., Samsung Adv. Inst. of Technol., Kyonggi-do, South Korea
Volume
11
Issue
4
fYear
2003
fDate
7/1/2003 12:00:00 AM
Firstpage
355
Lastpage
364
Abstract
It is essential to incorporate perceptual characteristics of human hearing in modern speech/audio coding systems. However, the focus has been confined only to the magnitude information of speech, and little attention has been paid to phase information. A quantitative study on the characteristics of human phase perception is presented and a novel method is proposed for the quantization of phase information in speech/audio signals. First, the just-noticeable difference (JND) of phase for each harmonic in flat-spectrum periodic tones is measured for several different fundamental frequencies. Then, a mathematical model of JND is established, based on measured data, to form a weighting function for phase quantization. Since the proposed weighting function is derived from psychoacoustic measurements, it provides a novel quantization method by which more bits are assigned to perceptually important phase components at the sacrifice of less important ones, resulting in a quantized signal perceptually closer to the original one. Experimental results on five vowel speech signals demonstrate that the proposed weighting function is very effective for the quantization of phase information.
Keywords
harmonics; hearing; quantisation (signal); speech coding; audio coding; flat-spectrum periodic tones; harmonic; human hearing perceptual characteristics; just-noticeable difference; phase perception; psychoacoustic measurements; speech coding; speech quantization; weighting function; Audio coding; Auditory system; Decoding; Frequency measurement; Humans; Phase measurement; Quantization; Shape; Speech analysis; Speech coding;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/TSA.2003.814409
Filename
1214851
Link To Document