• DocumentCode
    2007796
  • Title

    Microarray Classification from Several Two-Gene Expression Comparisons

  • Author

    German, Daniel ; Afsari, Bahman ; Tan, Aik Choon ; Naiman, Daniel Q.

  • fYear
    2008
  • fDate
    11-13 Dec. 2008
  • Firstpage
    583
  • Lastpage
    585
  • Abstract
    We describe our contribution to the ICMLA2008 ¿Automated Micro-Array Classification Challenge¿. The design of our classifier is motivated by the special scenario encountered in molecular cancer classification based on the mRNA concentrations provided by gene microarray data. Our classifier is rank-based; it only depends on expression comparisons among selected pairs of genes. Such comparisons are invariant to most of the transformations involved in preprocessing and normalization. Every pair of genes determines a binary classifier - choose the class for which the observed ordering is most likely. Pairs are scored by maximizing accuracy. In our k-TSP (k-disjoint Top Scoring Pairs) classifier, k disjoint pairs of genes are learned from training data; the discriminant function is simply the difference in the number of votes for the two classes. This rule involves exactly 2k genes, is readily interpretable, and provides some state-of-the-art results in cancer diagnosis and prognosis for small values of k, even k=1.
  • Keywords
    cancer; macromolecules; medical image processing; pattern classification; ICMLA2008; binary classifier; cancer diagnosis; cancer prognosis; k-disjoint top scoring classifier; mRNA concentrations; microarray classification; molecular cancer classification; two-gene expression; Biology computing; Cancer; DNA; Diseases; Gene expression; Machine learning; Probability distribution; Random variables; Training data; Voting; Molecular classification; cancer; gene expression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-0-7695-3495-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2008.152
  • Filename
    4725033