Title :
Using Emerging Subsequence in Classifying Protein Structural Class
Author :
Saeed, Khalid E K ; Lee, Heon Gyu ; Kim, Wun-Jae ; Cha, Eun-Jong ; Ryu, Keun Ho
Author_Institution :
Database/Bioinf. Lab., Chungbuk Nat. Univ., Cheongju, South Korea
Abstract :
Knowledge about protein´s structure can help in understanding its function and has many applications in computer-aided drug design and protein engineering. In this paper we introduce a new methodology for predicting protein structural class using Emerging Subsequences (ES). In a sequence database, an emerging subsequence of data class is a subsequence which occurs more frequently in that class rather than other classes. They can capture significant contrast between data classes. Our idea is to discover all the ES from protein sequence database and use as representatives for this data. Our experimental results using CATH database shows good result when evaluating the accuracy of the proposed method.
Keywords :
biology computing; data mining; database management systems; pattern classification; CATH database; drug design; emerging subsequences method; protein engineering; protein sequence database; protein structural class classification; protein structure; Amino acids; Application software; Bioinformatics; Computer applications; Databases; Drugs; Fuzzy systems; Laboratories; Protein engineering; Protein sequence;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.752