• DocumentCode
    2156337
  • Title

    On Preprocessing of SELDI-MS Data and its Evaluation

  • Author

    Prados, Julien ; Kalousis, Alexandros ; Hilario, Melanie

  • Author_Institution
    Geneva Artificial Intelligence Laboratory, Geneva Univ.
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    953
  • Lastpage
    958
  • Abstract
    Mass spectrometry is becoming an important tool in proteomics. Mass spectral data are characterized by very high dimensionality and a high level of redundancy. Both issues are quite challenging when one wants to perform knowledge discovery and push existing tools to their limits. We tackle both via a preprocessing pipeline that drastically reduces dimensionality and redundancy of the initial representation in order to focus on biologically relevant information. Essentially preprocessing performs feature extraction in a manner that reflects domain knowledge. We propose a framework for the evaluation of the given pipeline and in fact of any mass spectrometry preprocessing pipeline which is based on the level of conservation of discriminatory information. The discriminatory information content of a given representation is objectively measured by the classification performance of a number of classification algorithms evaluated on the given representation. This approach also allows us to compare a number of different preprocessing possibilities, namely using peak intensities vs peak areas to represent peaks and how non observed peaks should be treated, and demonstrate which is the most informative one
  • Keywords
    data mining; feature extraction; mass spectra; medical signal processing; molecular biophysics; proteins; SELDI-MS data preprocessing; discriminatory information; feature extraction; knowledge discovery; mass spectra; mass spectrometry; proteomics; Biomedical signal processing; Classification algorithms; Data mining; Diseases; Feature extraction; Mass spectroscopy; Pipelines; Proteins; Proteomics; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer-Based Medical Systems, 2006. CBMS 2006. 19th IEEE International Symposium on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1063-7125
  • Print_ISBN
    0-7695-2517-1
  • Type

    conf

  • DOI
    10.1109/CBMS.2006.122
  • Filename
    1647693