• DocumentCode
    2917273
  • Title

    Feature Mining and Integration for Improving the Prediction Accuracy of Translation Initiation Sites in Eukaryotic mRNAs

  • Author

    Ma, Chuang ; Zhou, Dao ; Zhou, Yanhong

  • Author_Institution
    Hubei Bioinformatics & Molecular Imaging Key Lab., Huazhong Univ. of Sci. & Technol.,, Wuhan
  • fYear
    2006
  • fDate
    Oct. 2006
  • Firstpage
    349
  • Lastpage
    356
  • Abstract
    Accurate prediction of translation initiation sites (TISs) is important for the annotation of genomes. Although many methods have been proposed to solve this problem, the prediction accuracy is still limited. In this paper, the features that have been widely used for predicting TISs are further analyzed, and it is found that some features of TISs and non-TISs are heavily dependent on the C+G content of sequences around AUG codons, and some features are quite different for non-TISs located in untranslated regions and coding regions considering different reading frames. Further, the strategy of using multiple support vector machines to fully make use of the information is proposed, and a new program TISKey for the prediction of TISs is developed. Testing results on widely used dataset demonstrate that TISKey could get better prediction accuracy. TISKey can be accessed at http://infosci.hust.edu.cn
  • Keywords
    DNA; biology computing; genetics; prediction theory; sequences; support vector machines; AUG codons; TISKey; accurate prediction; content of sequences; eukaryotic mRNAs; feature mining; genomes; prediction accuracy; reading frames; support vector machines; translation initiation sites; Accuracy; Bioinformatics; Genomics; Laboratories; Machine learning; Machine learning algorithms; Molecular imaging; Neural networks; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Grid and Cooperative Computing Workshops, 2006. GCCW '06. Fifth International Conference on
  • Conference_Location
    Hunan
  • Print_ISBN
    0-7695-2695-0
  • Type

    conf

  • DOI
    10.1109/GCCW.2006.40
  • Filename
    4031573