DocumentCode :
104592
Title :
Gene Selection Integrated with Biological Knowledge for Plant Stress Response Using Neighborhood System and Rough Set Theory
Author :
Jun Meng ; Jing Zhang ; Yushi Luan
Author_Institution :
Coll. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
Volume :
12
Issue :
2
fYear :
2015
fDate :
March-April 2015
Firstpage :
433
Lastpage :
444
Abstract :
Mining knowledge from gene expression data is a hot research topic and direction of bioinformatics. Gene selection and sample classification are significant research trends, due to the large amount of genes and small size of samples in gene expression data. Rough set theory has been successfully applied to gene selection, as it can select attributes without redundancy. To improve the interpretability of the selected genes, some researchers introduced biological knowledge. In this paper, we first employ neighborhood system to deal directly with the new information table formed by integrating gene expression data with biological knowledge, which can simultaneously present the information in multiple perspectives and do not weaken the information of individual gene for selection and classification. Then, we give a novel framework for gene selection and propose a significant gene selection method based on this framework by employing reduction algorithm in rough set theory. The proposed method is applied to the analysis of plant stress response. Experimental results on three data sets show that the proposed method is effective, as it can select significant gene subsets without redundancy and achieve high classification accuracy. Biological analysis for the results shows that the interpretability is well.
Keywords :
bioinformatics; botany; data mining; genetics; genomics; rough set theory; bioinformatics; biological analysis; biological knowledge; classification accuracy; gene classification; gene expression data; gene interpretability; gene selection method; gene subsets; information table; knowledge mining; neighborhood system; plant stress response; reduction algorithm; rough set theory; sample classification; Bioinformatics; Bismuth; Classification algorithms; Gene expression; Set theory; Stress; Rough set; biological knowledge; gene expression data; gene selection; neighborhood system;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2014.2361329
Filename :
6920020
Link To Document :
بازگشت