• DocumentCode
    1080265
  • Title

    Multiple Peak Alignment in Sequential Data Analysis: A Scale-Space-Based Approach

  • Author

    Weichuan Yu ; Xiaoye Li ; Junfeng Liu ; Baolin Wu ; Williams, K.R. ; Hongyu Zhao

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol.
  • Volume
    3
  • Issue
    3
  • fYear
    2006
  • Firstpage
    208
  • Lastpage
    219
  • Abstract
    In this paper, we address the multiple peak alignment problem in sequential data analysis with an approach based on the Gaussian scale-space theory. We assume that multiple sets of detected peaks are the observed samples of a set of common peaks. We also assume that the locations of the observed peaks follow unimodal distributions (e.g., normal distribution) with their means equal to the corresponding locations of the common peaks and variances reflecting the extension of their variations. Under these assumptions, we convert the problem of estimating locations of the unknown number of common peaks from multiple sets of detected peaks into a much simpler problem of searching for local maxima in the scale-space representation. The optimization of the scale parameter is achieved using an energy minimization approach. We compare our approach with a hierarchical clustering method using both simulated data and real mass spectrometry data. We also demonstrate the merit of extending the binary peak detection method (i.e., a candidate is considered either as a peak or as a nonpeak) with a quantitative scoring measure-based approach (i.e., we assign to each candidate a possibility of being a peak)
  • Keywords
    Gaussian processes; biology computing; mass spectroscopy; minimisation; molecular biophysics; proteins; Gaussian scale-space theory; binary peak detection method; energy minimization; hierarchical clustering method; mass spectrometry; multiple peak alignment; optimization; scale-space representation; scale-space-based approach; sequential data analysis; Biomarkers; Cancer; Data analysis; Diseases; Gaussian distribution; Instruments; Mass spectroscopy; Peptides; Proteins; Systematics; Biomarker discovery; energy minimization; multiple peak alignment; parameter optimization.; peak identification; prior information; scale-space; Algorithms; Amino Acid Sequence; Molecular Sequence Data; Proteins; Sequence Alignment; Sequence Analysis, Protein; Sequence Homology, Amino Acid;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2006.41
  • Filename
    1668020