DocumentCode :
534461
Title :
An unsupervised phenotypes and informative genes detection model with outlier consideration
Author :
Li, Yuan ; Zhao, Yuhai ; Wang, Guoren ; Wang, Zhanghui
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., NEU, Shenyang, China
Volume :
6
fYear :
2010
fDate :
16-18 Oct. 2010
Firstpage :
2280
Lastpage :
2284
Abstract :
The DNA microarray technology enables rapid, large scale screening for patterns of gene expression. It is meaningful to detect useful phenotypes and the informative genes that can manifest these phenotypes in gene expression data. While the existing methods of phenotypes discriminating are most supervised methods, they train samples based on the known informative genes. In this paper, we propose an unsupervised phenotypes and informative genes detection model with outlier consideration called UPID, which can simultaneously mining phenotypes and informative genes from gene expression data. By adopting incremental computing optimization strategies, the calculation of UPID is greatly reduced. Furthermore, UPID decreases the impact of outliers by taking the sample proportion of each group into consideration, which makes the model more robust. Compared with HS, a previous pattern detection method for gene expression data, it shows that the algorithm we proposed, UPID is more efficient. Moreover, the experiments conducted on several real microarray datasets prove the effectiveness of the UPID algorithm.
Keywords :
DNA; bioinformatics; data mining; genetics; molecular biophysics; molecular configurations; optimisation; DNA microarray technology; UPID algorithm; gene expression data; gene expression patterns; incremental computing optimization strategies; informative gene mining; outlier consideration; phenotype discrimination; phenotype mining; unsupervised informative gene detection model; unsupervised phenotype model; Algorithm design and analysis; Data mining; Gene expression; Heuristic algorithms; Noise; Optimization; Partitioning algorithms; Bioinformatics; Data mining; Informative genes; Microarray; Phenotype; gene expression data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical Engineering and Informatics (BMEI), 2010 3rd International Conference on
Conference_Location :
Yantai
Print_ISBN :
978-1-4244-6495-1
Type :
conf
DOI :
10.1109/BMEI.2010.5639328
Filename :
5639328
Link To Document :
بازگشت