DocumentCode
104592
Title
Gene Selection Integrated with Biological Knowledge for Plant Stress Response Using Neighborhood System and Rough Set Theory
Author
Jun Meng ; Jing Zhang ; Yushi Luan
Author_Institution
Coll. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
Volume
12
Issue
2
fYear
2015
fDate
March-April 2015
Firstpage
433
Lastpage
444
Abstract
Mining knowledge from gene expression data is a hot research topic and direction of bioinformatics. Gene selection and sample classification are significant research trends, due to the large amount of genes and small size of samples in gene expression data. Rough set theory has been successfully applied to gene selection, as it can select attributes without redundancy. To improve the interpretability of the selected genes, some researchers introduced biological knowledge. In this paper, we first employ neighborhood system to deal directly with the new information table formed by integrating gene expression data with biological knowledge, which can simultaneously present the information in multiple perspectives and do not weaken the information of individual gene for selection and classification. Then, we give a novel framework for gene selection and propose a significant gene selection method based on this framework by employing reduction algorithm in rough set theory. The proposed method is applied to the analysis of plant stress response. Experimental results on three data sets show that the proposed method is effective, as it can select significant gene subsets without redundancy and achieve high classification accuracy. Biological analysis for the results shows that the interpretability is well.
Keywords
bioinformatics; botany; data mining; genetics; genomics; rough set theory; bioinformatics; biological analysis; biological knowledge; classification accuracy; gene classification; gene expression data; gene interpretability; gene selection method; gene subsets; information table; knowledge mining; neighborhood system; plant stress response; reduction algorithm; rough set theory; sample classification; Bioinformatics; Bismuth; Classification algorithms; Gene expression; Set theory; Stress; Rough set; biological knowledge; gene expression data; gene selection; neighborhood system;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2014.2361329
Filename
6920020
Link To Document