DocumentCode
830132
Title
Dimension reduction-based penalized logistic regression for cancer classification using microarray data
Author
Shen, Li ; Tan, Eng Chong
Author_Institution
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
Volume
2
Issue
2
fYear
2005
Firstpage
166
Lastpage
175
Abstract
The use of penalized logistic regression for cancer classification using microarray expression data is presented. Two dimension reduction methods are respectively combined with the penalized logistic regression so that both the classification accuracy and computational speed are enhanced. Two other machine-learning methods, support vector machines and least-squares regression, have been chosen for comparison. It is shown that our methods have achieved at least equal or better results. They also have the advantage that the output probability can be explicitly given and the regression coefficients are easier to interpret. Several other aspects, such as the selection of penalty parameters and components, pertinent to the application of our methods for cancer classification are also discussed.
Keywords
arrays; cancer; learning (artificial intelligence); least squares approximations; logistics data processing; medical diagnostic computing; patient diagnosis; regression analysis; support vector machines; cancer classification; classification accuracy; computational speed; dimension reduction-based penalized logistic regression; least-squares regression; machine learning; microarray expression data; output probability; support vector machines; Bioinformatics; Cancer; Gene expression; Least squares methods; Logistics; Neoplasms; Singular value decomposition; Supervised learning; Support vector machine classification; Support vector machines; Dimension reduction; cancer classification; classifier design and evaluation; feature evaluation and selection; microarray data.; partial least squares; penalized logistic regression; singular value decomposition; Algorithms; Diagnosis, Computer-Assisted; Gene Expression Profiling; Humans; Logistic Models; Models, Genetic; Neoplasm Proteins; Neoplasms; Oligonucleotide Array Sequence Analysis; Regression Analysis; Reproducibility of Results; Sensitivity and Specificity; Tumor Markers, Biological;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2005.22
Filename
1438353
Link To Document