• DocumentCode
    1989212
  • Title

    Distance Preserving Dimension Reduction Using the QR Factorization or the Cholesky Factorization

  • Author

    Kim, Hyunsoo ; Park, Haesun ; Zha, Hongyuan

  • Author_Institution
    Coll. of Comput., Atlanta
  • fYear
    2007
  • fDate
    14-17 Oct. 2007
  • Firstpage
    263
  • Lastpage
    269
  • Abstract
    Dimension reduction plays an important role in handling the massive quantity of high dimensional data such as biomedical text data, gene expression data, and mass spectrometry data, and so forth. In this paper, we introduce distance preserving dimension reduction (DPDR) based on the QR factorization (DPDR/QR) or the Cholesky factorization (DPDR/C). DPDR generates lower dimensional representations of the high-dimensional data, which can exactly preserve Euclidean distances and cosine similarities between any pair of data points in the original dimensional space. After projecting data points to the lower dimensional space obtained from DPDR, one can execute other data analysis algorithms. DPDR can substantially reduce the computing time and/or memory requirement of a given data analysis algorithm, especially when we need to run the data analysis algorithm many times for estimating parameters or searching for a better solution.
  • Keywords
    biology computing; data analysis; medical computing; parameter estimation; Cholesky factorization; Euclidean distances; QR factorization; biomedical text data; cosine similarities; data analysis algorithms; distance preserving dimension reduction; gene expression data; mass spectrometry data; parameter estimation; Biology computing; Biomedical computing; Biomedical imaging; Data analysis; Drives; Educational institutions; Gene expression; Mass spectroscopy; Parameter estimation; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-1509-0
  • Type

    conf

  • DOI
    10.1109/BIBE.2007.4375575
  • Filename
    4375575