• DocumentCode
    749485
  • Title

    Perceptual phase quantization of speech

  • Author

    Kim, Doh-Suk

  • Author_Institution
    Human & Comput. Interaction Lab., Samsung Adv. Inst. of Technol., Kyonggi-do, South Korea
  • Volume
    11
  • Issue
    4
  • fYear
    2003
  • fDate
    7/1/2003 12:00:00 AM
  • Firstpage
    355
  • Lastpage
    364
  • Abstract
    It is essential to incorporate perceptual characteristics of human hearing in modern speech/audio coding systems. However, the focus has been confined only to the magnitude information of speech, and little attention has been paid to phase information. A quantitative study on the characteristics of human phase perception is presented and a novel method is proposed for the quantization of phase information in speech/audio signals. First, the just-noticeable difference (JND) of phase for each harmonic in flat-spectrum periodic tones is measured for several different fundamental frequencies. Then, a mathematical model of JND is established, based on measured data, to form a weighting function for phase quantization. Since the proposed weighting function is derived from psychoacoustic measurements, it provides a novel quantization method by which more bits are assigned to perceptually important phase components at the sacrifice of less important ones, resulting in a quantized signal perceptually closer to the original one. Experimental results on five vowel speech signals demonstrate that the proposed weighting function is very effective for the quantization of phase information.
  • Keywords
    harmonics; hearing; quantisation (signal); speech coding; audio coding; flat-spectrum periodic tones; harmonic; human hearing perceptual characteristics; just-noticeable difference; phase perception; psychoacoustic measurements; speech coding; speech quantization; weighting function; Audio coding; Auditory system; Decoding; Frequency measurement; Humans; Phase measurement; Quantization; Shape; Speech analysis; Speech coding;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2003.814409
  • Filename
    1214851