• DocumentCode
    2771537
  • Title

    Prediction of labor for pregnant women using high-resolution mass spectrometry data

  • Author

    Oh, Jung Hun ; Nandi, Animesh ; Gurnani, Prem ; Bryant-Greenwood, Peter ; Rosenblatt, Kevin P. ; Gao, Jean

  • Author_Institution
    Dept. of Comput. Sci. Eng., Texas Univ., Arlington, TX
  • fYear
    2006
  • fDate
    16-18 Oct. 2006
  • Firstpage
    332
  • Lastpage
    339
  • Abstract
    High-resolution MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry has shown promise as a screening tool for detecting discriminatory protein patterns. The major computational obstacle in analyzing MALDI-TOF data is a large number of mass/charge peaks (a.k.a. features, data points). With the number of data points easily going beyond one million for a single sample, efficient feature selection is critical for unequivocal protein pattern discovery. To tackle this problem, we have developed a multi-step strategy for data preprocessing and afterwards feature selection. The preprocessing is composed of binning, baseline correction, and normalization. For the preprocessed data, we propose a new feature subset selection method that is a hybrid filter/wrapper approach. Based on the two feature subsets for each feature, high and low correlated subsets, a feature is assigned a weight which indicates the extent of feature importance. Our scheme is applied to the analysis of labor dataset to predict delivery time of pregnant women. To validate the performance of the proposed algorithm, experiments are performed in comparison with other feature selection and classification methods. We show that our proposed approach outperforms other algorithms
  • Keywords
    biochemistry; biomedical measurement; feature extraction; laser applications in medicine; mass spectroscopic chemical analysis; medical computing; molecular biophysics; obstetrics; pattern classification; photoionisation; photon stimulated desorption; proteins; time of flight mass spectroscopy; classification methods; data preprocessing; feature subset selection method; high-resolution time-of-flight mass spectrometry data; hybrid filter; hybrid wrapper approach; labor prediction; matrix-assisted laser desorption; matrix-assisted laser ionization; pregnant women; protein patterns; unequivocal protein pattern discovery; Biomarkers; Cancer; Data analysis; Diseases; Ionization; Mass spectroscopy; Pathology; Pregnancy; Proteins; Proteomics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
  • Conference_Location
    Arlington, VA
  • Print_ISBN
    0-7695-2727-2
  • Type

    conf

  • DOI
    10.1109/BIBE.2006.253298
  • Filename
    4019678