• DocumentCode
    1989697
  • Title

    Mining Conserved Structures of Enzymes from Functional Hierarchical Classification

  • Author

    Huang, Yu-Feng ; Lin, Yu-Shin ; Hsu, Tian-Wei ; Huang, Chien-Kang

  • Author_Institution
    Nat. Taiwan Univ., Taipei
  • fYear
    2007
  • fDate
    14-17 Oct. 2007
  • Firstpage
    418
  • Lastpage
    424
  • Abstract
    Sequence conservation related to protein function has been discovered via protein sequence alignment and pattern mining. In contrast, our motivation is to mine structure conservation via frequent itemset mining from the viewpoint of structure. In order to describe local structure, neighborhood residue sphere (NRS) is proposed, which is a sphere with 10 A radius of each residue with the combination of sequence and spatial information. Currently, we obtain 56,164 NRSs among 456 EC labels of local conserved region out of total 646 EC labels. In EC label prediction, our experimental results reveal 80.61% Confidence and 53% Accuracy while selecting 1,000 proteins with sequence identity less than 60% from 13,373 enzymes among 563 EC labels. Due to the coverage rate is around 80% higher than CSA and Protemot, the Confidence is almost doubled in comparing with CSA and Protemot. In this study, we choose alternative to figure out function-related local structure without using protein binding site information of protein-ligand complexes.
  • Keywords
    biology computing; data mining; enzymes; Protemot; enzymes; mining conserved structures; neighborhood residue sphere; proteins; Amino acids; Biochemistry; Biology; Computer science; Data mining; Information analysis; Itemsets; Oceans; Protein engineering; Protein sequence; Local structure conservation; enzyme classification prediction; neighborhood residues sphere; protein structure mining; structure conservation mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-1509-0
  • Type

    conf

  • DOI
    10.1109/BIBE.2007.4375596
  • Filename
    4375596