• Title of article

    Bayesian Shrinkage Estimation of the Relative Abundance of mRNA Transcripts Using SAGE

  • Author/Authors

    J.S.، Morris نويسنده , , K.A.، Baggerly نويسنده , , K.R.، Coombes نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2003
  • Pages
    -475
  • From page
    476
  • To page
    0
  • Abstract
    Serial analysis of gene expression (SAGE) is a technology for quantifying gene expression in biological tissue that yields count data that can be modeled by a multinomial distribution with two characteristics: skewness in the relative frequencies and small sample size relative to the dimension. As a result of these characteristics, a given SAGE sample may fail to capture a large number of expressed mRNA species present in the tissue. Empirical estimators of mRNA speciesʹ relative abundance effectively ignore these missing species, and as a result tend to overestimate the abundance of the scarce observed species comprising a vast majority of the total. We have developed a new Bayesian estimation procedure that quantifies our prior information about these characteristics, yielding a nonlinear shrinkage estimator with efficiency advantages over the MLE. Our prior is mixture of Dirichlets, whereby species are stochastically partitioned into abundant and scarce classes, each with its own multivariate prior. Simulation studies reveal our estimator has lower integrated mean squared error (IMSE) than the MLE for the SAGE scenarios simulated, and yields relative abundance profiles closer in Euclidean distance to the truth for all samples simulated. We apply our method to a SAGE library of normal colon tissue, and discuss its implications for assessing differential expression.
  • Keywords
    bioinformatics , sage , Shrinkage estimators , Multinomial distribution , mixture distributions , Bayesian methods
  • Journal title
    BIOMETRICS (BIOMETRIC SOCIETY)
  • Serial Year
    2003
  • Journal title
    BIOMETRICS (BIOMETRIC SOCIETY)
  • Record number

    84154