DocumentCode :
2484135
Title :
Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data
Author :
Sánchez, R. ; Argáez, M. ; Guillén, P.
Author_Institution :
Univ. of Texas at El Paso, El Paso, TX, USA
fYear :
2011
fDate :
Aug. 30 2011-Sept. 3 2011
Firstpage :
3362
Lastpage :
3366
Abstract :
The development of cancer diagnosis models and cancer discovery from DNA microarray data are of great interest in bioinformatics and medicine. In pattern recognition and machine learning, a classification problem refers to finding an algorithm for assigning a given input data into one of several categories. Many natural signals are sparse or compressible in the sense that they have short representations when expressed in a suitable basis. Motivated by the recent successful algorithm developments for sparse signal recovery, we apply the selective nature of sparse representation to perform the above mentioned classification. In order to find such sparse representation we implement an ℓ1-minimization algorithm. This methodology overcomes the lack of robustness with respect to outliers. In contrast to other classification algorithms, no model selection dependency is involved. The minimization algorithm is a convex relaxation-like that has been proven to efficiently recover sparse signals. To study its performance, the proposed method is applied to six tumor gene expression datasets and numerically compared with various support vector machine methods (SVM). The numerical results show that the ℓ1-minimization algorithm proposed performs at least comparably and often better than SVMs.
Keywords :
bioinformatics; cancer; genetics; lab-on-a-chip; learning (artificial intelligence); medical signal processing; minimisation; patient diagnosis; pattern recognition; signal classification; support vector machines; tumours; ℓ1-minimization; DNA microarray data; SVM; bioinformatics; cancer diagnosis model; cancer discovery; classification problem; convex relaxation-like model; machine learning; medicine; minimization algorithm; pattern recognition; sparse representation; sparse signal recovery; support vector machine method; tumor classification; tumor gene expression datasets; underdetermined system; Cancer; Classification algorithms; Gene expression; Support vector machines; Training; Tumors; Vectors; Algorithms; Gene Expression; Humans; Neoplasms; Support Vector Machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE
Conference_Location :
Boston, MA
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4121-1
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/IEMBS.2011.6090911
Filename :
6090911
Link To Document :
بازگشت