Title :
Strategy of finding optimal number of features on gene expression data
Author :
Sharma, Ashok ; Koh, C.H. ; Imoto, Seiya ; Miyano, S.
Author_Institution :
Human Genome Center, Univ. of Tokyo, Tokyo, Japan
Abstract :
Feature selection is considered to be an important step in the analysis of transcriptomes or gene expression data. Carrying out feature selection reduces the curse of the dimensionality problem and improves the interpretability of the problem. Numerous feature selection methods have been proposed in the literature and these methods rank the genes in order of their relative importance. However, most of these methods determine the number of genes to be used in an arbitraryly or heuristic fashion. Proposed is a theoretical way to determine the optimal number of genes to be selected for a given task. This proposed strategy has been applied on a number of gene expression datasets and promising results have been obtained.
Keywords :
cancer; feature extraction; image classification; medical image processing; feature selection; gene expression datasets; heuristic fashion; optimal number; transcriptome analysis;
Journal_Title :
Electronics Letters
DOI :
10.1049/el.2011.0526