DocumentCode
496345
Title
Data Pre-processing for More Effective Gene Clustering
Author
Hou, Jingyu ; Chen, Yi-Ping Phoebe
Author_Institution
Sch. of Eng. & Inf. Technol., Deakin Univ., Burwood, VIC, Australia
Volume
1
fYear
2009
fDate
24-26 April 2009
Firstpage
710
Lastpage
713
Abstract
The high-throughput experimental data from the new gene microarray technology has spurred numerous efforts to find effective ways of processing microarray data for revealing real biological relationships among genes. This work proposes an innovative data pre-processing approach to identify noise data in the data sets and eliminate or reduce the impact of the noise data on gene clustering, With the proposed algorithm, the pre-processed data sets make the clustering results stable across clustering algorithms with different similarity metrics, the important information of genes and features is kept, and the clustering quality is improved. The primary evaluation on real microarray data sets has shown the effectiveness of the proposed algorithm.
Keywords
biology computing; genetics; pattern clustering; biological relationship; clustering quality; data preprocessing; gene clustering; gene microarray technology; noise data identification; Australia; Biology; Clustering algorithms; Data engineering; Filters; Gene expression; Information technology; Noise measurement; Noise reduction; Road transportation; Bioinformatics; clustering; data pre-processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Sciences and Optimization, 2009. CSO 2009. International Joint Conference on
Conference_Location
Sanya, Hainan
Print_ISBN
978-0-7695-3605-7
Type
conf
DOI
10.1109/CSO.2009.328
Filename
5193792
Link To Document