DocumentCode :
59908
Title :
Nonparametric Bayesian Dereverberation of Power Spectrograms Based on Infinite-Order Autoregressive Processes
Author :
Maezawa, Akira ; Itoyama, Katsutoshi ; Yoshii, Kazutomo ; Okuno, Hiroshi G.
Author_Institution :
Yamaha Corp., Shizuoka, Japan
Volume :
22
Issue :
12
fYear :
2014
fDate :
Dec. 2014
Firstpage :
1918
Lastpage :
1930
Abstract :
This paper describes a monaural audio dereverberation method that operates in the power spectrogram domain. The method is robust to different kinds of source signals such as speech or music. Moreover, it requires little manual intervention, including the complexity of room acoustics. The method is based on a non-conjugate Bayesian model of the power spectrogram. It extends the idea of multi-channel linear prediction to the power spectrogram domain, and formulates a model of reverberation as a non-negative, infinite-order autoregressive process. To this end, the power spectrogram is interpreted as a histogram count data, which allows a nonparametric Bayesian model to be used as the prior for the autoregressive process, allowing the effective number of active components to grow, without bound, with the complexity of data. In order to determine the marginal posterior distribution, a convergent algorithm, inspired by the variational Bayes method, is formulated. It employs the minorization-maximization technique to arrive at an iterative, convergent algorithm that approximates the marginal posterior distribution. Both objective and subjective evaluations show advantage over other methods based on the power spectrum. We also apply the method to a music information retrieval task and demonstrate its effectiveness.
Keywords :
Bayes methods; architectural acoustics; audio signal processing; autoregressive processes; iterative methods; nonparametric statistics; optimisation; prediction theory; reverberation; histogram count data; iterative convergent algorithm; marginal posterior distribution; minorization-maximization technique; monaural audio dereverberation method; multichannel linear prediction; music information retrieval task; nonconjugate Bayesian model; nonnegative infinite-order autoregressive process; nonparametric Bayesian dereverberation; power spectrogram domain; reverberation model; room acoustics; source signal; variational Bayes method; Complexity theory; Data models; Reverberation; Spectrogram; Speech; Speech processing; Dereverberation; minorization maximization; nonparameteric Bayes;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2014.2355772
Filename :
6894190
Link To Document :
بازگشت