• DocumentCode
    1362050
  • Title

    Microarray Data Classifier Consisting of k-Top-Scoring Rank-Comparison Decision Rules With a Variable Number of Genes

  • Author

    Yoon, Youngmi ; Bien, Sangjay ; Park, Sanghyun

  • Author_Institution
    Dept. of Inf. Technol., Gachon Univ. of Med. & Sci., Incheon, South Korea
  • Volume
    40
  • Issue
    2
  • fYear
    2010
  • fDate
    3/1/2010 12:00:00 AM
  • Firstpage
    216
  • Lastpage
    226
  • Abstract
    Microarray experiments generate quantitative expression measurements for thousands of genes simultaneously, which is useful for phenotype classification of many diseases. Our proposed phenotype classifier is an ensemble method with k-top-scoring decision rules. Each rule involves a number of genes, a rank comparison relation among them, and a class label. Current classifiers, which are also ensemble methods, consist of k-top-scoring decision rules. Some of these classifiers fix the number of genes in each rule as a triple or a pair. In this paper, we generalize the number of genes involved in each rule. The number of genes in each rule ranges from 2 to N, respectively. Generalizing the number of genes increases the robustness and the reliability of the classifier for the class prediction of an independent sample. Our algorithm saves resources by combining shorter rules in order to build a longer rule. It converges rapidly toward its high-scoring rule list by implementing several heuristics. The parameter k is determined by applying leave-one-out cross validation to the training dataset.
  • Keywords
    DNA; biology computing; data analysis; data mining; diseases; pattern classification; diseases; k-top-scoring rank-comparison decision rules; knowledge-based data mining; microarray data analysis; microarray data classifier; phenotype classification; quantitative expression measurements; Data mining; knowledge-based data mining; microarray data analysis; microarray data classification;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1094-6977
  • Type

    jour

  • DOI
    10.1109/TSMCC.2009.2036594
  • Filename
    5357431