DocumentCode
1652699
Title
Multichannel object-based audio coding with controllable quality
Author
Gorlow, Stanislaw ; Habets, Emanuel A. P. ; Marchand, Sylvain
Author_Institution
LaBRI, Univ. Bordeaux, Talence, France
fYear
2013
Firstpage
561
Lastpage
565
Abstract
In this paper a new multichannel object-based audio coding scheme with scalable signal quality is proposed. The novel scheme is based on controlled downmixing and demixing. By means of a dedicated control mechanism, a number of distinct audio objects are mixed into a lower number of channels. The latter is chosen such that the desired quality level is met after demixing. The quality is assessed with two new psychoacoustically motivated metrics. Following the informed source separation approach, the downmix is decomposed via optimum spatial filtering guided by short-time power spectral densities of the audio objects. In an experiment it is shown that the raw data rate of an exemplary 10-track recording can be reduced by at least 30 % using linear pulse-code modulation while maintaining perceptual transparency.
Keywords
audio coding; blind source separation; pulse code modulation; quality control; spatial filters; spectral analysis; controllable quality; controlled demixing; controlled downmixing; dedicated control mechanism; distinct audio objects; linear pulse-code modulation; multichannel object-based audio coding scheme; optimum spatial filtering; perceptual transparency; psychoacoustically motivated metrics; quality level; scalable signal quality; short-time power spectral density; source separation approach; Arrays; Covariance matrices; Decoding; Encoding; Measurement; Signal to noise ratio; Source separation; Audio coding; multichannel; object-based; quality control; spatial filtering;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6637710
Filename
6637710
Link To Document