DocumentCode
2771537
Title
Prediction of labor for pregnant women using high-resolution mass spectrometry data
Author
Oh, Jung Hun ; Nandi, Animesh ; Gurnani, Prem ; Bryant-Greenwood, Peter ; Rosenblatt, Kevin P. ; Gao, Jean
Author_Institution
Dept. of Comput. Sci. Eng., Texas Univ., Arlington, TX
fYear
2006
fDate
16-18 Oct. 2006
Firstpage
332
Lastpage
339
Abstract
High-resolution MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry has shown promise as a screening tool for detecting discriminatory protein patterns. The major computational obstacle in analyzing MALDI-TOF data is a large number of mass/charge peaks (a.k.a. features, data points). With the number of data points easily going beyond one million for a single sample, efficient feature selection is critical for unequivocal protein pattern discovery. To tackle this problem, we have developed a multi-step strategy for data preprocessing and afterwards feature selection. The preprocessing is composed of binning, baseline correction, and normalization. For the preprocessed data, we propose a new feature subset selection method that is a hybrid filter/wrapper approach. Based on the two feature subsets for each feature, high and low correlated subsets, a feature is assigned a weight which indicates the extent of feature importance. Our scheme is applied to the analysis of labor dataset to predict delivery time of pregnant women. To validate the performance of the proposed algorithm, experiments are performed in comparison with other feature selection and classification methods. We show that our proposed approach outperforms other algorithms
Keywords
biochemistry; biomedical measurement; feature extraction; laser applications in medicine; mass spectroscopic chemical analysis; medical computing; molecular biophysics; obstetrics; pattern classification; photoionisation; photon stimulated desorption; proteins; time of flight mass spectroscopy; classification methods; data preprocessing; feature subset selection method; high-resolution time-of-flight mass spectrometry data; hybrid filter; hybrid wrapper approach; labor prediction; matrix-assisted laser desorption; matrix-assisted laser ionization; pregnant women; protein patterns; unequivocal protein pattern discovery; Biomarkers; Cancer; Data analysis; Diseases; Ionization; Mass spectroscopy; Pathology; Pregnancy; Proteins; Proteomics;
fLanguage
English
Publisher
ieee
Conference_Titel
BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
Conference_Location
Arlington, VA
Print_ISBN
0-7695-2727-2
Type
conf
DOI
10.1109/BIBE.2006.253298
Filename
4019678
Link To Document