DocumentCode :
1535183
Title :
Fuzzy–Rough Sets for Information Measures and Selection of Relevant Genes From Microarray Data
Author :
Maji, Pradipta ; Pal, Sankar K.
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Volume :
40
Issue :
3
fYear :
2010
fDate :
6/1/2010 12:00:00 AM
Firstpage :
741
Lastpage :
752
Abstract :
Several information measures such as entropy, mutual information, and f-information have been shown to be successful for selecting a set of relevant and nonredundant genes from a high-dimensional microarray data set. However, for continuous gene expression values, it is very difficult to find the true density functions and to perform the integrations required to compute different information measures. In this regard, the concept of the fuzzy equivalence partition matrix is presented to approximate the true marginal and joint distributions of continuous gene expression values. The fuzzy equivalence partition matrix is based on the theory of fuzzy-rough sets, where each row of the matrix represents a fuzzy equivalence partition that can automatically be derived from the given expression values. The performance of the proposed approach is compared with that of existing approaches using the class separability index and the predictive accuracy of the support vector machine. An important finding, however, is that the proposed approach is shown to be effective for selecting relevant and nonredundant continuous-valued genes from microarray data.
Keywords :
biology computing; fuzzy set theory; genetics; matrix algebra; rough set theory; support vector machines; continuous gene expression values; fuzzy equivalence partition matrix; fuzzy-rough sets; information measures; microarray data; nonredundant genes; relevant genes; support vector machine; Classification; gene selection; information measures; microarray analysis; rough sets; Algorithms; Artificial Intelligence; Computer Simulation; Decision Support Techniques; Fuzzy Logic; Gene Expression Profiling; Models, Theoretical; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2009.2028433
Filename :
5308229
Link To Document :
بازگشت