Title :
The TF-IDF and Neural Networks Approach for Translation Initiation Site Prediction
Author :
Kongmanee, Tarintorn ; Vanichayobon, Sirirut ; Wettayaprasit, Wiphada
Author_Institution :
Comput. Sci. Dept., Prince of Songkla Univ., Songkla, Thailand
Abstract :
The precise prediction of translation initiation site is an important task for the analysis of genomic sequence. This study aims to increase the accuracy for the prediction of translation initiation site using a TF-IDF-NN-TIS model (TF-IDF and neural networks approach for translation initiation site prediction). This study creates feature using 1-gram and 2-gram techniques for both upstream and downstream. Determining feature value uses TF-IDF approach and feature selection by correlation-based feature selection method. Evaluation prediction results use 10-fold cross validation. This study performed experiments on three different datasets that are Vertebrate, Arabidopsis thaliana, and TIS+50. The results of the study indicate that the proposed model gives highest accuracy with less processing time.
Keywords :
biology computing; correlation methods; neural nets; TF-IDF; TF-IDF-NN-TIS model; correlation-based feature selection method; genomic sequence analysis; neural networks; translation initiation site prediction; Accuracy; Artificial intelligence; Artificial neural networks; Bioinformatics; Computer science; DNA; Genomics; Laboratories; Neural networks; Sequences; TF-IDF; correlation-based feature selection; neural networks; translation initation sites;
Conference_Titel :
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4519-6
Electronic_ISBN :
978-1-4244-4520-2
DOI :
10.1109/ICCSIT.2009.5234582