Title : 
Diagnostic character location within the cryptic skipper butterfly species complex with an evolutionary algorithm
         
        
            Author : 
Ashlock, Daniel ; Von Königslöw, Taika
         
        
            Author_Institution : 
Dept. of Math. & Stat., Univ. of Guelph, Guelph, ON
         
        
        
            fDate : 
March 30 2009-April 2 2009
         
        
        
        
            Abstract : 
This study presents an evolutionary algorithm for locating DNA sequence characters that are diagnostic between closely related groups of species. The algorithm is developed using synthetic data and then tested on biological data from a species of butterfly recently discovered to be a cryptic complex of species. This technique proved to be successful in locating positions that are diagnostic of the cryptic neotropical skipper butterfly species within the cytochrome c oxidase subunit I (COI) DNA barcode data. The algorithm uses a novel subset representation to select positions within the DNA sequences. A crossover operator that takes pairs of subsets to pairs of subsets is designed. This crossover operator permits the use of a novel mutation operator that disrupts loci showing evidence of convergence, yielding better preservation of diversity in the evolving population of diagnostic character positions. A lexical (tie breaking) fitness function is used to smooth the fitness landscape. The problem of locating diagnostic positions in DNA sequences proved difficult without lexical fitness; with that innovation in place the problem is quite tractable. The evolutionary algorithm developed has the potential for broad application such as in conservation, customs enforcement, and forensics.
         
        
            Keywords : 
DNA; biology computing; genomics; DNA sequence; cryptic skipper butterfly species complex; cytochrome c oxidase subunit I; diagnostic character location; evolutionary algorithm; lexical fitness function; mutation operator; Additives; DNA; Documentation; Evolutionary computation; Forensics; Hamming distance; Insects; Life estimation; Organisms; Sequences;
         
        
        
        
            Conference_Titel : 
Computational Intelligence in Bioinformatics and Computational Biology, 2009. CIBCB '09. IEEE Symposium on
         
        
            Conference_Location : 
Nashville, TN
         
        
            Print_ISBN : 
978-1-4244-2756-7
         
        
        
            DOI : 
10.1109/CIBCB.2009.4925713