Title :
Sparse representation based feature selection for mass spectrometry data
Author :
Ke, Jiqing ; Zhu, Lei ; Han, Bin ; Dai, Qi ; Wang, Yaojia ; Li, Lihua ; Xu, Shenhua ; Mou, Hanzhou ; Zheng, Zhiguo
Author_Institution :
Inst. of Biomed. Eng. & Instrum., Hangzhou Dianzi Univ., Hangzhou, China
Abstract :
Mass spectrometry (MS) data has been widely analyzed for the detection of early stage cancers. Its potential for seeking proteomic biomarkers has received a great deal of attention in recent years. In the sparse representation classification (SRC) framework, a testing sample is represented as a sparse linear combination of training samples. The coefficient vector of representation is obtained by a ℓ1-norm regularized least square method. Classification results are achieved by defining discriminant functions from the coefficient vector for each category. In this paper, a novel feature selection method based on SRC was proposed. To investigate its performance, the proposed methods was tested and evaluated on the ovarian cancer database OC-WCX2a and OC-WCX2b. The experimental results showed that SRC is efficient for tumor classification. Feature selection based on sparse representation (SRFS) can select highly predictive representative feature sets.
Keywords :
bioinformatics; cancer; data structures; feature extraction; mass spectra; patient diagnosis; pattern classification; proteomics; tumours; OC-WCX2a; OC-WCX2b; early stage cancer detection; feature selection; l1-norm regularized least square method; mass spectrometry data; ovarian cancer database; proteomic biomarkers; sparse linear combination; sparse representation classification; training samples; tumor classification; feature selection; protein mass spectrum; sparse representation;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2010 IEEE International Conference on
Conference_Location :
Hong, Kong
Print_ISBN :
978-1-4244-8303-7
Electronic_ISBN :
978-1-4244-8304-4
DOI :
10.1109/BIBMW.2010.5703773