DocumentCode
3069994
Title
Direct MDCT Domain Psychoacoustic Modeling
Author
Suresh, K. ; Sreenivas, TV
Author_Institution
Indian Inst. of Sci., Bangalore
fYear
2007
fDate
15-18 Dec. 2007
Firstpage
742
Lastpage
747
Abstract
We extend the recently proposed spectral integration based psychoacoustic model for sinusoidal distortions to the MDCT domain. The estimated masking threshold additionally depends on the sub-band spectral flatness measure of the signal which accounts for the non- sinusoidal distortion introduced by masking. The expressions for masking threshold are derived and the validity of the proposed model is established through perceptual transparency test of audio clips. Test results indicate that we do achieve transparent quality reconstruction with the new model. Performance of the model is compared with MPEG psychoacoustic models with respect to the estimated perceptual entropy (PE). The results show that the proposed model predicts a lower PE than other models.
Keywords
audio coding; data compression; discrete cosine transforms; distortion; entropy codes; signal reconstruction; spectral analysis; MPEG psychoacoustic model; digital audio compression; direct MDCT domain psychoacoustic modeling; masking threshold estimation; perceptual entropy estimation; quality reconstruction; sinusoidal distortion; spectral integration based psychoacoustic model; subband spectral flatness measure; Audio coding; Auditory system; Distortion; Frequency domain analysis; Humans; Masking threshold; Psychoacoustic models; Psychology; Signal processing; Transform coding; Psychoacoustics; audio coding; masking threshold;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology, 2007 IEEE International Symposium on
Conference_Location
Giza
Print_ISBN
978-1-4244-1835-0
Electronic_ISBN
978-1-4244-1835-0
Type
conf
DOI
10.1109/ISSPIT.2007.4458108
Filename
4458108
Link To Document