DocumentCode
3083727
Title
Research on the Application of Pattern Selection Algorithm in Bioinformatic Data Bases on Mutual Information
Author
Li, Xin ; Hong, Wenxue ; Zhao, Chun
Author_Institution
Inst. of Biomed. Eng., Yanshan Univ., Qinhuangdao, China
fYear
2010
fDate
17-19 Sept. 2010
Firstpage
1022
Lastpage
1025
Abstract
Pattern selection is an important part in the research fields of data mining and pattern recognition, especially for the high-dimensional data in the Bioinformatics. In this paper, a new pattern selection algorithm was proposed to finish pattern selection bases on Mutual Information. Pattern subset evaluation index was researched to ensure the best feature subset was selected. The algorithm bases on the correlation of patterns and label, as well as the redundancy between the patterns. Fuzzy Pattern Subset Evaluation Index was researched to make sure which is the best subset for the pattern subset evaluation. To verify the effectiveness of the method, some experiments were finished with the data of gene expression data (Leiden University) and UCI datasets. The experimental results indicate that the algorithm achieved better results.
Keywords
bioinformatics; data mining; database management systems; pattern classification; set theory; Leiden University; UCI dataset; bioinformatic database; data mining; fuzzy pattern subset evaluation index; gene expression data; mutual information; pattern recognition; pattern selection algorithm; Bioinformatics; Classification algorithms; Gene expression; Indexes; Mutual information; Pattern recognition; Redundancy; Bioinformatics; Feature Selection; Mutual Information; Pattern Selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Pervasive Computing Signal Processing and Applications (PCSPA), 2010 First International Conference on
Conference_Location
Harbin
Print_ISBN
978-1-4244-8043-2
Electronic_ISBN
978-0-7695-4180-8
Type
conf
DOI
10.1109/PCSPA.2010.252
Filename
5635668
Link To Document