Title :
Mining Conserved Structures of Enzymes from Functional Hierarchical Classification
Author :
Huang, Yu-Feng ; Lin, Yu-Shin ; Hsu, Tian-Wei ; Huang, Chien-Kang
Author_Institution :
Nat. Taiwan Univ., Taipei
Abstract :
Sequence conservation related to protein function has been discovered via protein sequence alignment and pattern mining. In contrast, our motivation is to mine structure conservation via frequent itemset mining from the viewpoint of structure. In order to describe local structure, neighborhood residue sphere (NRS) is proposed, which is a sphere with 10 A radius of each residue with the combination of sequence and spatial information. Currently, we obtain 56,164 NRSs among 456 EC labels of local conserved region out of total 646 EC labels. In EC label prediction, our experimental results reveal 80.61% Confidence and 53% Accuracy while selecting 1,000 proteins with sequence identity less than 60% from 13,373 enzymes among 563 EC labels. Due to the coverage rate is around 80% higher than CSA and Protemot, the Confidence is almost doubled in comparing with CSA and Protemot. In this study, we choose alternative to figure out function-related local structure without using protein binding site information of protein-ligand complexes.
Keywords :
biology computing; data mining; enzymes; Protemot; enzymes; mining conserved structures; neighborhood residue sphere; proteins; Amino acids; Biochemistry; Biology; Computer science; Data mining; Information analysis; Itemsets; Oceans; Protein engineering; Protein sequence; Local structure conservation; enzyme classification prediction; neighborhood residues sphere; protein structure mining; structure conservation mining;
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
DOI :
10.1109/BIBE.2007.4375596