DocumentCode :
2156337
Title :
On Preprocessing of SELDI-MS Data and its Evaluation
Author :
Prados, Julien ; Kalousis, Alexandros ; Hilario, Melanie
Author_Institution :
Geneva Artificial Intelligence Laboratory, Geneva Univ.
fYear :
0
fDate :
0-0 0
Firstpage :
953
Lastpage :
958
Abstract :
Mass spectrometry is becoming an important tool in proteomics. Mass spectral data are characterized by very high dimensionality and a high level of redundancy. Both issues are quite challenging when one wants to perform knowledge discovery and push existing tools to their limits. We tackle both via a preprocessing pipeline that drastically reduces dimensionality and redundancy of the initial representation in order to focus on biologically relevant information. Essentially preprocessing performs feature extraction in a manner that reflects domain knowledge. We propose a framework for the evaluation of the given pipeline and in fact of any mass spectrometry preprocessing pipeline which is based on the level of conservation of discriminatory information. The discriminatory information content of a given representation is objectively measured by the classification performance of a number of classification algorithms evaluated on the given representation. This approach also allows us to compare a number of different preprocessing possibilities, namely using peak intensities vs peak areas to represent peaks and how non observed peaks should be treated, and demonstrate which is the most informative one
Keywords :
data mining; feature extraction; mass spectra; medical signal processing; molecular biophysics; proteins; SELDI-MS data preprocessing; discriminatory information; feature extraction; knowledge discovery; mass spectra; mass spectrometry; proteomics; Biomedical signal processing; Classification algorithms; Data mining; Diseases; Feature extraction; Mass spectroscopy; Pipelines; Proteins; Proteomics; Signal processing algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems, 2006. CBMS 2006. 19th IEEE International Symposium on
Conference_Location :
Salt Lake City, UT
ISSN :
1063-7125
Print_ISBN :
0-7695-2517-1
Type :
conf
DOI :
10.1109/CBMS.2006.122
Filename :
1647693
Link To Document :
بازگشت