• Title of article

    Variable Selection and Joint Estimation of Mean and Covariance Models with an Application to eQTL Data

  • Author/Authors

    Lee, JungJun Department of Statistics - Korea University - Seoul, Republic of Korea , Kim, SungHwan Department of Applied Statistics - Konkuk University - Seoul, Republic of Korea , Jhong, Jae-Hwan Department of Statistics - Korea University - Seoul, Republic of Korea , Koo, Ja-Yong Department of Statistics - Korea University - Seoul, Republic of Korea

  • Pages
    13
  • From page
    1
  • To page
    13
  • Abstract
    In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modifed Cholesky decomposition is utilized to parametrize entries of a precision matrix. Te JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is efectively achieved via the proposed variable selection scheme and contributes to exploring signifcant hub genes shown to be concordant to a priori biological evidence. In simulation studies, we confrm that our model selection efciently identifes the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway.
  • Keywords
    eQTL , Estimation , Covariance , GCV
  • Journal title
    Computational and Mathematical Methods in Medicine
  • Serial Year
    2018
  • Record number

    2610576