Title :
MANTIS: A Data Mining Methodology for Effective Translation Initiation Site Prediction
Author :
Tzanis, G. ; Berberidis, C. ; Vlahavas, I.
Author_Institution :
Aristotle Univ. of Thessaloniki, Thessaloniki
Abstract :
The prediction of the translation initiation site in a genomic sequence with the highest possible accuracy is an important problem that still has to be investigated by the research community. Current approaches perform quite well, however there is still room for a more general framework for the researchers who want to follow an effective and reliable methodology. We developed a prediction methodology that combines ad hoc as well as discovered knowledge in order to significantly increase the achieved accuracy reliably. Our methodology is modular and consists of three major decision components: a consensus component, a coding region classification component and a novel ATG location-based component that allows for the utilization of the advantages of the popular ribosome scanning model while overcoming its limitations. All three of them are combined into a meta-classification system, using stacked generalization, in a highly effective prediction framework. We performed extensive comparative experiments on four different datasets, showing that the increase in terms of accuracy and adjusted accuracy is not only statistically significant, but also the highest reported.
Keywords :
biology computing; data mining; genetics; MANTIS; data mining; decision components; genomic sequence; metaclassification system; ribosome scanning model; translation initiation site prediction; Accuracy; Artificial neural networks; Assembly systems; Bioinformatics; Biological system modeling; Context modeling; Data mining; Genomics; Predictive models; Support vector machines; Animals; Database Management Systems; Forecasting; Humans; Peptide Chain Initiation, Translational;
Conference_Titel :
Engineering in Medicine and Biology Society, 2007. EMBS 2007. 29th Annual International Conference of the IEEE
Conference_Location :
Lyon
Print_ISBN :
978-1-4244-0787-3
DOI :
10.1109/IEMBS.2007.4353806