• DocumentCode
    1798771
  • Title

    Comparison of two tonality estimation methods used in a psychoacoustic model

  • Author

    Hao Chen ; Taghipour, Armin ; Edler, Bernd

  • Author_Institution
    Int. Audio Labs. Erlangen, Erlangen, Germany
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    706
  • Lastpage
    710
  • Abstract
    Perceptual audio codecs apply psychoacoustic principles such as masking effects of the human auditory system in order to reduce irrelevancies in the input audio signal. Psychoacoustic studies show differences between masking strength of tonal and noise maskers: the masking effect of narrowband noise is stronger than that of a tone which has the same power and is placed in the center frequency of the noise. In this paper, two tonality estimation methods are discussed which are implemented in a filter bank based psychoacoustic model. The first method is called Partial Spectral Flatness Measure (PSFM) and the second is referred to as Amplitude Modulation Ratio (AM-R). The psychoacoustic model uses a set of complex band-pass filters. It was designed according to the temporal/spectral resolution of the human auditory system, and takes into account post masking as well as the spreading effect of individual local maskers in simultaneous masking. This paper describes the model, tonality estimation methods and their implementation. The estimators are compared to each other by subjective tests. The results are presented and discussed.
  • Keywords
    amplitude modulation; audio coding; channel bank filters; codecs; hearing; spectral analysis; AM-R; PSFM; amplitude modulation ratio; audio signal processing; complex band-pass filters; filter bank based psychoacoustic model; human auditory system; irrelevancy reduction; masking effect; narrowband noise; noise masker; partial spectral flatness measure; perceptual audio codecs; spectral resolution; spreading effect; temporal resolution; tonal masker; tonality estimation method; Audio coding; Codecs; Entropy; Estimation; Masking threshold; Noise; perceptual audio coding; perceptual model; psychoacoustic model; spectral flatness; tonality estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-3902-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2014.7009886
  • Filename
    7009886