DocumentCode :
617999
Title :
Woven string kernels for DNA sequence classification
Author :
McEachern, Andrew ; Ashlock, Daniel
Author_Institution :
Dept. of Math. & Stat., Univ. of Guelph, Guelph, ON, Canada
fYear :
2013
fDate :
20-23 June 2013
Firstpage :
1578
Lastpage :
1585
Abstract :
Woven string kernels are a form of evolvable directed acyclic graph specialized to perform DNA classification. They are introduced in this study and tested on simple and complex synthetic data as well as biological data. The WSKs perform marginally on the simplest synthetic data - based on GC content - for which they are not entirely appropriate. They exhibit perfect classification on the more complex synthetic data and on the biological data. Woven string kernels have a number of parameters including their height, the number of initial strings from which they are built, and the amount of “weaving” used to generate the final structure. A parameter study shows that these parameters must be set based on the type of data under analysis. The paper concludes with comments on possible improvements of the woven string kernel technique.
Keywords :
DNA; biology computing; directed graphs; pattern classification; DNA sequence classification; GC content; WSK; biological data; complex synthetic data; evolvable directed acyclic graph; height parameter; initial string number; simple synthetic data; weaving amount; woven string kernel technique; Automata; Correlation; DNA; Kernel; Standards; Vegetation; Weaving;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2013 IEEE Congress on
Conference_Location :
Cancun
Print_ISBN :
978-1-4799-0453-2
Electronic_ISBN :
978-1-4799-0452-5
Type :
conf
DOI :
10.1109/CEC.2013.6557750
Filename :
6557750
Link To Document :
بازگشت