• DocumentCode
    1515507
  • Title

    Sticky Hidden Markov Modeling of Comparative Genomic Hybridization

  • Author

    Du, Lan ; Chen, Minhua ; Lucas, Joseph ; Carin, Lawrence

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Duke Univ., Durham, NC, USA
  • Volume
    58
  • Issue
    10
  • fYear
    2010
  • Firstpage
    5353
  • Lastpage
    5368
  • Abstract
    We develop a sticky hidden Markov model (HMM) with a Dirichlet distribution (DD) prior, motivated by the problem of analyzing comparative genomic hybridization (CGH) data. As formulated the sticky DD-HMM prior is employed to infer the number of states in an HMM, while also imposing state persistence. The form of the proposed hierarchical model allows efficient variational Bayesian (VB) inference, of interest for large-scale CGH problems. We compare alternative formulations of the sticky HMM, while also examining the relative efficacy of VB and Markov chain Monte Carlo (MCMC) inference. To validate the formulation, example results are presented for an illustrative synthesized data set and our main application-CGH, for which we consider data for breast cancer. For the latter, we also make comparisons and partially validate the CGH analysis through factor analysis of associated (but distinct) gene-expression data.
  • Keywords
    DNA; Monte Carlo methods; biocomputing; genomics; hidden Markov models; DNA copy number; Dirichlet distribution; Markov chain Monte Carlo inference; breast cancer; comparative genomic hybridization data analysis; factor analysis; gene-expression data; hidden Markov modeling; hierarchical model; sticky DD-HMM prior; variational Bayesian inference; Bayesian methods; Bioinformatics; Biological cells; DNA; Data analysis; Genomics; Hidden Markov models; Matrix decomposition; Permission; Predictive models; DNA copy number; hidden Markov model (HMM); hierarchical Bayesian modeling; multi-task learning (MTL); variational Bayes (VB);
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/TSP.2010.2053033
  • Filename
    5484506