Title :
Research on the Application of Pattern Selection Algorithm in Bioinformatic Data Bases on Mutual Information
Author :
Li, Xin ; Hong, Wenxue ; Zhao, Chun
Author_Institution :
Inst. of Biomed. Eng., Yanshan Univ., Qinhuangdao, China
Abstract :
Pattern selection is an important part in the research fields of data mining and pattern recognition, especially for the high-dimensional data in the Bioinformatics. In this paper, a new pattern selection algorithm was proposed to finish pattern selection bases on Mutual Information. Pattern subset evaluation index was researched to ensure the best feature subset was selected. The algorithm bases on the correlation of patterns and label, as well as the redundancy between the patterns. Fuzzy Pattern Subset Evaluation Index was researched to make sure which is the best subset for the pattern subset evaluation. To verify the effectiveness of the method, some experiments were finished with the data of gene expression data (Leiden University) and UCI datasets. The experimental results indicate that the algorithm achieved better results.
Keywords :
bioinformatics; data mining; database management systems; pattern classification; set theory; Leiden University; UCI dataset; bioinformatic database; data mining; fuzzy pattern subset evaluation index; gene expression data; mutual information; pattern recognition; pattern selection algorithm; Bioinformatics; Classification algorithms; Gene expression; Indexes; Mutual information; Pattern recognition; Redundancy; Bioinformatics; Feature Selection; Mutual Information; Pattern Selection;
Conference_Titel :
Pervasive Computing Signal Processing and Applications (PCSPA), 2010 First International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-8043-2
Electronic_ISBN :
978-0-7695-4180-8
DOI :
10.1109/PCSPA.2010.252