• DocumentCode
    3477718
  • Title

    Feature Extraction in Spatially-Conserved Regions and Protein Functional Classification

  • Author

    Lee, Bum Ju ; Lee, Heon Gyu ; Kim, Dae-sung ; Ryu, Keun Ho

  • Author_Institution
    Database/Bioinf. Lab., Chungbuk Nat. Univ., Cheongju
  • fYear
    2007
  • fDate
    11-13 Oct. 2007
  • Firstpage
    165
  • Lastpage
    170
  • Abstract
    One of the most challenging problems in bioinformatics is prediction of protein functions and structures in unknown protein sequences. The sequence similarity-based approach is the most effective method for the prediction of protein function, but the approach often fails to identify the relevant proteins when similarity does not exist or exists at very low levels. Therefore, it is important to develop prediction and classification methods of protein function without sequence similarity. Our aim is to suggest protein function classification using protein properties without sequence similarity. In this paper, we propose feature extraction in spatially-conserved region sequences and apply high-ranked features through the selection of attributes for the classification of protein function. The experimental results demonstrate that RMSE and MAE rates decrease after low-ranked attributes are discarded from our classification. Our method points out classification using only important short sequences such as motif or conserved regions.
  • Keywords
    biology computing; feature extraction; molecular biophysics; proteins; sequences; MAE; RMSE; bioinformatics; feature extraction; protein functional classification; protein sequences; protein structures; sequence similarity-based approach; Amino acids; Biochemistry; Bioinformatics; Data mining; Feature extraction; Genetic mutations; Hidden Markov models; Protein engineering; Spatial databases; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in the Convergence of Bioscience and Information Technologies, 2007. FBIT 2007
  • Conference_Location
    Jeju City
  • Print_ISBN
    978-0-7695-2999-8
  • Type

    conf

  • DOI
    10.1109/FBIT.2007.51
  • Filename
    4524098