DocumentCode
1989586
Title
Super Granular Shrink-SVM Feature Elimination (Super GS-SVM-FE) Model for Protein Sequence Motif Information Extraction
Author
Chen, Bernard ; Pellicer, Stephen ; Tai, Phang C. ; Harrison, Robert ; Pan, Yi
Author_Institution
Georgia State Univ., Atlanta
fYear
2007
fDate
14-17 Oct. 2007
Firstpage
379
Lastpage
386
Abstract
Protein sequence motifs are gathering more and more attention in the sequence analysis area. These recurring regions have the potential to determine protein ´s conformation, function and activities. In our previous work, we tried to obtain protein sequence motifs which are universally conserved across protein family boundaries. Therefore, unlike most popular motif discovering algorithms, our input dataset is extremely large. In order to deal with large input datasets, we provided two granular computing models (FIK and FGK model) to efficiently generate protein motifs information and Super GSVM-FE model to do the feature elimination for improving the quality of motif information. In this article, we tried to further improve our SVM feature elimination model to achieve three goals: Reduce time execution by half, further improve motif information quality and add the ability of adjusting the number of filtered segments. Compared with the latest results, our new approach shows great improvements.
Keywords
biology computing; macromolecules; proteins; support vector machines; conformation; elimination model; filtered segments; fuzzy-Greedy-Kmeans model; granular computing models; information extraction; protein sequence motifs; super granular shrink-SVM feature elimination model; Biological system modeling; Biology; Clustering algorithms; Computer science; Data mining; Information filtering; Information filters; Protein sequence; Sequences; Space technology;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location
Boston, MA
Print_ISBN
978-1-4244-1509-0
Type
conf
DOI
10.1109/BIBE.2007.4375591
Filename
4375591
Link To Document