Title :
A neural network approach to multi-biomarker panel development based on LC/MS/MS proteomics profiles: A case study in breast cancer
Author :
Zhang, Fan ; Chen, Jake Y.
Author_Institution :
Sch. of Inf., Indiana Univ., Indianapolis, IN, USA
Abstract :
Liquid chromatography tandem mass spectrometry (LC/MS/MS) based plasma proteomics profiling technique is a promising technology platform to study candidate protein biomarkers for complex human diseases such as cancer. Factors such as inherent variability, protein detectability limitation, and peptide discovery biases among LC/MS/MS platforms have made the classification and prediction of proteomics profiles challenging. In this paper, we developed a proteomics data analysis method to identify multi-protein biomarker panels for breast cancer diagnosis based on artificial neural networks. Using this method, we first applied standard analysis of variance (ANOVA) to derive a list of single candidate biomarkers that significantly changed from plasma proteomics profiles between breast cancer and controls. Next, we constructed a feed forward neural network (FFNN) for each combination of single marker proteins and trained with plasma proteomics results derived from 40 breast cancer women and 40 control women. We evaluated the results for best five-marker panel and ten-marker panels on a testing data set of similar cohort of 80 plasma proteomics profiles, of which half are breast cancer women and half are controls, using both statistical methods (receiver operating characteristics curve comparisons) and biological literature validation. We found that five-marker panel using two-variable FFNN output achieved the best prediction performance in testing data set, with 82.5% in sensitivity and 82.5% in specificity. Our computational method can serve as a general model for multi-biomarker panel discovery applications in other diseases.
Keywords :
biological organs; cancer; chromatography; data analysis; feedforward neural nets; mammography; mass spectroscopic chemical analysis; medical diagnostic computing; proteins; proteomics; statistical analysis; tumours; ANOVA; FFNN; LC/MS/MS plasma proteomics profiling technique; applied standard analysis of variance; artificial neural network approach; biological literature validation; breast cancer diagnosis; computational method; feed forward neural network; human diseases; liquid chromatography tandem mass spectrometry; multibiomarker panel development; multibiomarker panel discovery applications; protein biomarkers; protein detectability limitation; proteomics data analysis method; receiver operating characteristics curve comparisons; single marker proteins; statistical methods; two-variable FFNN output; Analysis of variance; Artificial neural networks; Biomarkers; Breast cancer; Diseases; Neural networks; Plasmas; Proteins; Proteomics; Testing;
Conference_Titel :
Computer-Based Medical Systems, 2009. CBMS 2009. 22nd IEEE International Symposium on
Conference_Location :
Albuquerque, NM
Print_ISBN :
978-1-4244-4879-1
Electronic_ISBN :
1063-7125
DOI :
10.1109/CBMS.2009.5255456