Title :
Improving splice-junctions classification employing a novel encoding schema and decision-tree
Author :
Salekdeh, Amin Yazdani ; Wiese, Kay C.
Author_Institution :
Sch. of Comput. Sci., Simon Fraser Univ., Vancouver, BC, Canada
Abstract :
Splice junctions are important regions in genes, which have been studied in many studies in Genetics. Recently some attempts in computer science have been made to use computer power in distinguishing the different splice junctions and non-junctions regions in genes. Ambiguity in the measurements of nucleotides is an important issue in dealing with these regions. In this paper a novel method is proposed along with an encoding schema which take ambiguities into account using probabilistic intuitions. The method is based on Decision Trees, using K Nearest Negihbours and Support Vector Machines. The results have shown the significance of using the proposed encoding schema and classification method.
Keywords :
biology computing; decision trees; encoding; genetics; pattern classification; support vector machines; K-nearest neighbour; decision tree; encoding; genetic; nucleotide; probabilistic intuition; splice-junction classification; support vector machine; Accuracy; DNA; Decision trees; Encoding; Hamming distance; Junctions; Support vector machines;
Conference_Titel :
Evolutionary Computation (CEC), 2011 IEEE Congress on
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7834-7
DOI :
10.1109/CEC.2011.5949766