• DocumentCode
    3743660
  • Title

    Integrated analysis of multiple high-dimensional data sets by joint rank-1 matrix approximations

  • Author

    Ashkan Zeinalzadeh;Tom Wenska;Gordon Okimoto

  • Author_Institution
    Cancer Center, University of Hawaii at Manoa, USA
  • fYear
    2015
  • Firstpage
    3852
  • Lastpage
    3857
  • Abstract
    In this work, we developed an algorithm for the integrated analysis of multiple high-dimensional data matrices based on sparse rank-one matrix approximations. The algorithm approximates multiple data matrices with rank one outer products composed of sparse left singular-vectors that are unique to each matrix and a right singular-vector that is shared by all of the data matrices. The right-singular vector represents a signal we wish to detect in the row-space of each matrix. The non-zero components of the resulting left-singular vectors identify rows of each matrix that in aggregate provide a sparse linear representation of the shared right-singular vector. This sparse representation facilitates downstream interpretation and validation of the resulting model based on the rows selected from each matrix. False discovery rate is used to select an appropriate ℓ1 penalty parameter that imposes sparsity on the left singular-vector but not the common right singular-vector of the joint approximation. Since a given multi-modal data set (MMDS) may contain multiple signals of interest the algorithm is iteratively applied to the residualized version of original data to sequentially capture and model each distinct signal in terms of rows from the different matrices. We show that the algorithm outperforms standard singular value decomposition over a wide range of simulation scenarios in terms of detection accuracy. Analysis of real data for ovarian and liver cancer resulted in compact gene expression signatures that were predictive of clinical outcomes and highly enriched for cancer related biology.
  • Keywords
    "Sparse matrices","Detectors","Standards","Approximation algorithms","Matrix decomposition","Algorithm design and analysis","Cancer"
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2015 IEEE 54th Annual Conference on
  • Type

    conf

  • DOI
    10.1109/CDC.2015.7402818
  • Filename
    7402818