DocumentCode :
3239370
Title :
Optimal Bayesian MMSE estimation of the coefficient of determination for discrete prediction
Author :
Ting Chen ; Braga-Neto, Ulisses
Author_Institution :
Dept. of Electr. Eng., Texas A & M Univ., College Station, TX, USA
fYear :
2013
fDate :
17-19 Nov. 2013
Firstpage :
66
Lastpage :
69
Abstract :
The coefficient of determination (CoD) has significant applications in genomics, for example, in the inference of gene regulatory networks. In previous publications, we have studied several nonparametric CoD estimators, based upon the resubstitution, leave-one-out, cross-validation, and bootstrap error estimators, and one parametric maximum-likelihood (ML) CoD estimator that allows the incorporation of available prior knowledge, from a frequentist perspective. However, none of these CoD estimators are rigorously optimized based on statistical inference across a family of possible distributions. Therefore, by following the idea of Bayesian error estimation for classification, we define a Bayesian CoD estimator that minimizes the mean-square error (MSE), based on a parametrized family of joint distributions between predictors and target as a function of random parameters characterized by assumed prior distributions. We derive an exact formulation of the sample-based Bayesian MMSE CoD estimator. Numerical experiments are carried out to estimate performance metrics of the Bayesian CoD estimator and compare them against those of resubstitution, leave-one-out, bootstrap and cross-validation CoD estimators over all the distributions, by employing the Monte Carlo sample method. Results show that the Bayesian CoD estimator has the best performance, displaying zero bias, small variance, and least root mean-square error (RMS).
Keywords :
Bayes methods; Monte Carlo methods; error analysis; genetics; genomics; inference mechanisms; least mean squares methods; maximum likelihood estimation; minimisation; random processes; statistical distributions; Monte Carlo sample method; bootstrap CoD estimators; coefficient of determination; cross-validation CoD estimators; discrete prediction; gene regulatory network inference; genomics; least root mean-square error; leave-one-out CoD estimators; minimization; nonparametric CoD estimators; numerical experiments; parametric maximum-likelihood CoD estimator; random parameter function; resubstitution CoD estimators; sample-based Bayesian MMSE CoD estimator; statistical inference; Bayes methods; Maximum likelihood estimation; Measurement; Monte Carlo methods; Tin; Vectors; Bayesian Estimation; Coefficient of Determination; Discrete Prediction; Minimum Mean-Square Error;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics (GENSIPS), 2013 IEEE International Workshop on
Conference_Location :
Houston, TX
Print_ISBN :
978-1-4799-3461-4
Type :
conf
DOI :
10.1109/GENSIPS.2013.6735933
Filename :
6735933
Link To Document :
بازگشت