DocumentCode
3397496
Title
Bit-rate reduction using psychoacoustical masking model in Frequency Domain Linear Prediction based Audio Codec
Author
Shi, Dong ; Xi, Du ; Ruimin, Hu
Author_Institution
Nat. Eng. Res. Center for Multimedia Software, Wuhan Univ., Wuhan, China
Volume
2
fYear
2010
fDate
30-31 May 2010
Firstpage
229
Lastpage
232
Abstract
Frequency Domain Linear Prediction (FDLP) gives an approximation of the Hilbert envelopes of a signal. FDLP based Codec works with long temporal segments and keeps the information carried by the time-domain envelopes very well. The codec gives good quality of the reconstructed signal, but coding efficiency is not enough. Here Frequency masking is introduced to FDLP based codec to reduce the bit-rate. Frequency masking is a hearing phenomenon that the hearing threshold of a sound will increase if an intense sound exists simultaneously. The psychoacoustics model is used to estimate the hearing threshold and the absolute threshold of hearing (ATH) of the FDLP carrier signals, and bit allocation for frequency sub-bands FDLP carrier signal is calculated according to the threshold and ATH. 5% bit-rate reduction is obtained with the application of the frequency masking. Perceptual Evaluation of Audio Quality (PEAQ) and Multiple Stimuli with Hidden Reference and Anchor (MUSHRA) test are carried out to evaluate the performance.
Keywords
Auditory system; Bit rate; Codecs; Frequency domain analysis; Frequency estimation; Predictive models; Psychoacoustic models; Psychology; Radio spectrum management; Time domain analysis; Frequency Domain Linear Prediction (FDLP); audio coding; frequency masking; psychoacoustics mode;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Mechatronics and Automation (ICIMA), 2010 2nd International Conference on
Conference_Location
Wuhan, China
Print_ISBN
978-1-4244-7653-4
Type
conf
DOI
10.1109/ICINDMA.2010.5538328
Filename
5538328
Link To Document