• DocumentCode
    464314
  • Title

    Understanding the Prediction of Transmembrane Proteins by Support Vector Machine using Association Rule Mining

  • Author

    Hu, Hae-Jin ; Wang, Hao ; Harrison, Robert ; Tai, Phang C. ; Pan, Yi

  • Author_Institution
    Molecular Basis of Disease Program, Georgia State Univ., Atlanta, GA
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    418
  • Lastpage
    425
  • Abstract
    With the efforts to understand protein structure, many computational approaches have been made recently. Among them, the support vector machine (SVM) methods have been recently applied and showed successful performance compared with other machine learning schemes. However, despite the high performance, the SVM approaches suffer from the problem of understandability since it is a black-box model. To overcome this limitation, this study attempted to combine the SVM with the association rule based classifier which can present the meaningful explanation about the prediction. To perform this task, a new association rule based classifier (PCPAR) was devised based on the existing classifier, CPAR, to handle the sequential data. PCPAR creates the patterns by merging the generated rules and then classifies the sequential data based on the pattern match. The experimental result presents the following: with sequential data, the PCPAR scheme shows better performance with respect to the accuracy and the number of generated patterns than CPAR method whether applied alone or combined with SVM. The combined scheme of SVMPCPAR generates more compact patterns than the combined scheme of SVM with decision tree, SVM DT, with similar performance. These patterns are easily understandable and biologically meaningful
  • Keywords
    biology computing; data mining; pattern classification; proteins; support vector machines; association rule based classifier; association rule mining; machine learning; support vector machine; transmembrane proteins; Association rules; Biology computing; Computer science; Data mining; Decision trees; Machine learning; Proteins; Prototypes; Support vector machine classification; Support vector machines; CPAR; PCPAR; association rule based classifier; decision tree; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0710-9
  • Type

    conf

  • DOI
    10.1109/CIBCB.2007.4221252
  • Filename
    4221252