مرکز منطقه ای اطلاع رساني علوم و فناوري - Gamma Markov Random Fields for Audio Source Modeling

DocumentCode :

1365663

Title :

Gamma Markov Random Fields for Audio Source Modeling

Author :

Dikmen, Onur ; Cemgil, A. Taylan

Author_Institution :

Comput. Eng. Dept., Bogazici Univ., Istanbul, Turkey

Volume :

Issue :

fYear :

2010

fDate :

3/1/2010 12:00:00 AM

Firstpage :

589

Lastpage :

601

Abstract :

In many audio processing tasks, such as source separation, denoising or compression, it is crucial to construct realistic and flexible models to capture the physical properties of audio signals. This can be accomplished in the Bayesian framework through the use of appropriate prior distributions. In this paper, we describe a class of prior models called Gamma Markov random fields (GMRFs) to model the sparsity and the local dependency of the energies (i.e., variances) of time-frequency expansion coefficients. A GMRF model describes a non-normalised joint distribution over unobserved variance variables, where given the field the actual source coefficients are independent. Our construction ensures a positive coupling between the variance variables, so that signal energy changes smoothly over both axes to capture the temporal and spectral continuity. The coupling strength is controlled by a set of hyperparameters. Inference on the overall model is convenient because of the conditional conjugacy of all of the variables in the model, but automatic optimization of hyperparameters is crucial to obtain better fits. The marginal likelihood of the model is not available because of the intractable normalizing constant of GMRFs. In this paper, we optimize the hyperparameters of our GMRF-based audio model using contrastive divergence and compare this method to alternatives such as score matching and pseudolikelihood maximization where applicable. We present the performance of the GMRF models in denoising and single-channel source separation problems in completely blind scenarios, where all the hyperparameters are jointly estimated given only audio data.

Keywords :

Markov processes; audio signal processing; signal denoising; source separation; Bayesian framework; Gamma Markov random fields; Gibbs sampling; audio signals; audio source modeling; nonnormalised joint distribution; signal denoising; single-channel source separation problems; Audio modeling; Gibbs sampling; Markov random fields; contrastive divergence; denoising; pseudolikelihood; score matching; single-channel source separation;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2009.2031778

Filename :

5233871

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1365663