• DocumentCode
    496345
  • Title

    Data Pre-processing for More Effective Gene Clustering

  • Author

    Hou, Jingyu ; Chen, Yi-Ping Phoebe

  • Author_Institution
    Sch. of Eng. & Inf. Technol., Deakin Univ., Burwood, VIC, Australia
  • Volume
    1
  • fYear
    2009
  • fDate
    24-26 April 2009
  • Firstpage
    710
  • Lastpage
    713
  • Abstract
    The high-throughput experimental data from the new gene microarray technology has spurred numerous efforts to find effective ways of processing microarray data for revealing real biological relationships among genes. This work proposes an innovative data pre-processing approach to identify noise data in the data sets and eliminate or reduce the impact of the noise data on gene clustering, With the proposed algorithm, the pre-processed data sets make the clustering results stable across clustering algorithms with different similarity metrics, the important information of genes and features is kept, and the clustering quality is improved. The primary evaluation on real microarray data sets has shown the effectiveness of the proposed algorithm.
  • Keywords
    biology computing; genetics; pattern clustering; biological relationship; clustering quality; data preprocessing; gene clustering; gene microarray technology; noise data identification; Australia; Biology; Clustering algorithms; Data engineering; Filters; Gene expression; Information technology; Noise measurement; Noise reduction; Road transportation; Bioinformatics; clustering; data pre-processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Sciences and Optimization, 2009. CSO 2009. International Joint Conference on
  • Conference_Location
    Sanya, Hainan
  • Print_ISBN
    978-0-7695-3605-7
  • Type

    conf

  • DOI
    10.1109/CSO.2009.328
  • Filename
    5193792