DocumentCode
1932890
Title
Feature Extraction from Protein Sequences and Classification of Enzyme Function
Author
Lee, Bum Ju ; Ryu, Keun Ho
Author_Institution
Sch. of Electr. & Comput. Eng., Chungbuk Nat. Univ., Chungbuk
Volume
1
fYear
2008
fDate
27-30 May 2008
Firstpage
138
Lastpage
142
Abstract
Enzymes are biological catalysts that mediate almost all chemical reactions and are found in all tissues and fluids of the body. These enzymes play a central role in metabolic pathways, and in the prediction of metabolic pathways. Our goals in the present study were to identify new features for reliable enzyme functional classification and prediction that do not rely on sequence alignment, and to improve the accuracy or lower the error rate using the attribute selection method. In this study, we designed novel features, including PPR, NNR, PNPR, PPRDist(x, y), NNRDist(x, y), and PNPRDist(x, y), extracted from each protein sequence. Using only protein sequences, we compiled a set of 84 attributes that characterize proteins, and obtained accuracy of 72.13% through identification of optimal attributes in a given dataset. Our experiment results demonstrate that these attributes, as novel features, are useful for enzyme functional classification. In addition, we identify and analyze the biologically meaningful features of a given dataset.
Keywords
biochemistry; catalysts; enzymes; feature extraction; molecular biophysics; pattern classification; NNRDist(x, y); PNPRDist(x, y); PPRDist(x, y); biological catalysts; enzyme functional classification; feature extraction; metabolic pathways; protein sequences; sequence alignment; Amino acids; Biochemistry; Biomedical engineering; Discrete cosine transforms; Feature extraction; Protein engineering; Protein sequence; Solvents; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
BioMedical Engineering and Informatics, 2008. BMEI 2008. International Conference on
Conference_Location
Sanya
Print_ISBN
978-0-7695-3118-2
Type
conf
DOI
10.1109/BMEI.2008.341
Filename
4548651
Link To Document