• DocumentCode
    2498071
  • Title

    An MM-Based Optimization Algorithm for Sparse Linear Modeling on Microarray Data Analysis

  • Author

    Chen, Xiaohui ; Gottardo, Raphael

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of British Columbia, Vancouver, BC, Canada
  • fYear
    2009
  • fDate
    11-13 June 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Sparsity is crucial for high-dimensional statistical modeling. On one hand, dimensionality reduction can reduce the variability of estimation and thus provide reliable predictive power. On the other hand, the selected sub-model can discover and emphasize the underlying dependencies, which is useful for objective interpretation. Many variable selection methods have been proposed in literatures. For a prominent example, Least Absolute Shrinkage and Selection Operator (lasso) in linear regression context has been extensively explored. This paper discusses a class of scaled mixture of Gaussian models from both a penalized likelihood and a Bayesian regression point of view. We propose an Majorize-Minimize (MM) algorithm to find the Maximum A Posteriori (MAP) estimator, where the EM algorithm can be stuck at local optimum for some members in this class. Simulation studies show the outperformance of proposed algorithm in nonstochastic design variable selection scenario. The proposed algorithm is applied to a real large-scale E.coli data set with known bona fide interactions for constructing sparse gene regulatory networks. We show that our regression networks with a properly chosen prior can perform comparably to state-of-the-art regulatory network construction algorithms.
  • Keywords
    Bayes methods; bioinformatics; data reduction; genomics; maximum likelihood estimation; microorganisms; optimisation; Bayesian regression approach; Gaussian models; dimensionality reduction; high dimensional statistical modeling; large scale E.coli data set; majorize-minimize based optimization algorithm; maximum a posteriori estimator; microarray data analysis; penalized likelihood approach; sparse gene regulatory networks; sparse linear modeling; variable selection methods; Algorithm design and analysis; Bayesian methods; Cancer; Data analysis; Input variables; Large-scale systems; Linear regression; Predictive models; Regulators; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-2901-1
  • Electronic_ISBN
    978-1-4244-2902-8
  • Type

    conf

  • DOI
    10.1109/ICBBE.2009.5162334
  • Filename
    5162334