Title :
An Invariant Bayesian Model Selection Principle for Gaussian Data in a Sparse Representation
Author :
Fossgaard, Eirik ; Flå, Tor
Author_Institution :
Dept. of Chem., Tromso Univ.
Abstract :
In this paper, we develop a code length principle which is invariant to the choice of parameterization on the model distributions, that is the code length remains the same under smooth transformations on the likelihood parameters. An invariant approximation formula for easy computation of the marginal distribution is provided for Gaussian likelihood models. We provide invariant estimators of the model parameters and formulate conditions under which these estimators are essentially posteriori unbiased for Gaussian models. An upper bound on the coarseness of discretization on the model parameters is deduced. We introduce a discrimination measure between probability distributions and use it to construct probability distributions on model classes and show how this may induce an additional code length term k/4log2k for a k-parameter model. The total code length is shown to be closely related to the normalized maximum likelihood (NML) code length of Rissanen when choosing Jeffreys prior distribution on the model parameters together with a uniform prior distribution on the model classes. Our model selection principle is applied to a Gaussian estimation problem for data in a wavelet representation and its performance is tested and compared to alternative wavelet-based estimation methods in numerical experiments
Keywords :
Bayes methods; Gaussian processes; approximation theory; image denoising; image representation; maximum likelihood estimation; probability; wavelet transforms; Gaussian data; Gaussian estimation; Jeffreys prior distribution; approximation formula; invariant Bayesian model; marginal distribution; maximum likelihood code length principle; probability distribution; wavelet representation; Bayesian methods; Distributed computing; Gaussian distribution; Length measurement; Maximum likelihood estimation; Noise reduction; Pollution measurement; Probability distribution; Testing; Upper bound; Compression; denoising; generalized Gaussian distribution; invariance Laplace approximation; minimum description length (MDL); minimum message length (MML); model class prior; natural images; statistical estimation; wavelet representation;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2006.878170