DocumentCode
464314
Title
Understanding the Prediction of Transmembrane Proteins by Support Vector Machine using Association Rule Mining
Author
Hu, Hae-Jin ; Wang, Hao ; Harrison, Robert ; Tai, Phang C. ; Pan, Yi
Author_Institution
Molecular Basis of Disease Program, Georgia State Univ., Atlanta, GA
fYear
2007
fDate
1-5 April 2007
Firstpage
418
Lastpage
425
Abstract
With the efforts to understand protein structure, many computational approaches have been made recently. Among them, the support vector machine (SVM) methods have been recently applied and showed successful performance compared with other machine learning schemes. However, despite the high performance, the SVM approaches suffer from the problem of understandability since it is a black-box model. To overcome this limitation, this study attempted to combine the SVM with the association rule based classifier which can present the meaningful explanation about the prediction. To perform this task, a new association rule based classifier (PCPAR) was devised based on the existing classifier, CPAR, to handle the sequential data. PCPAR creates the patterns by merging the generated rules and then classifies the sequential data based on the pattern match. The experimental result presents the following: with sequential data, the PCPAR scheme shows better performance with respect to the accuracy and the number of generated patterns than CPAR method whether applied alone or combined with SVM. The combined scheme of SVMPCPAR generates more compact patterns than the combined scheme of SVM with decision tree, SVM DT, with similar performance. These patterns are easily understandable and biologically meaningful
Keywords
biology computing; data mining; pattern classification; proteins; support vector machines; association rule based classifier; association rule mining; machine learning; support vector machine; transmembrane proteins; Association rules; Biology computing; Computer science; Data mining; Decision trees; Machine learning; Proteins; Prototypes; Support vector machine classification; Support vector machines; CPAR; PCPAR; association rule based classifier; decision tree; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
Conference_Location
Honolulu, HI
Print_ISBN
1-4244-0710-9
Type
conf
DOI
10.1109/CIBCB.2007.4221252
Filename
4221252
Link To Document