DocumentCode :
2812029
Title :
Noise-to-mask ratio minimization by weighted non-negative matrix factorization
Author :
Nikunen, Joonas ; Virtanen, Tuomas
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
25
Lastpage :
28
Abstract :
This paper proposes a novel algorithm for minimizing the perceptual distortion in non-negative matrix factorization (NMF) based audio representation. We formulate the noise-to-mask ratio audio quality criterion in a form where it can be used in NMF and propose an algorithm for optimizing the criterion. We also propose a method for compensating the spreading of the representation error in the synthesis filterbank. The objective perceptual quality produced by the proposed method is found to outperform all the reference methods. We also study the trade-off between the window length and the rank of factorization with a fixed data rate, and find that the best performance is obtained with window lengths between 10 and 30 ms.
Keywords :
audio signal processing; channel bank filters; matrix decomposition; signal representation; audio representation; noise-to-mask ratio minimization; perceptual distortion; representation error; synthesis filterbank; weighted non-negative matrix factorization; window length; Acoustic noise; Audio coding; Filter bank; Masking threshold; Noise measurement; Nuclear magnetic resonance; Psychoacoustic models; Signal processing; Signal processing algorithms; Signal to noise ratio; Audio coding; Noise-to-mask ratio; Non-negative matrix factorization; Signal representations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5496264
Filename :
5496264
Link To Document :
بازگشت