• DocumentCode
    3394061
  • Title

    Diagnostic character location within the cryptic skipper butterfly species complex with an evolutionary algorithm

  • Author

    Ashlock, Daniel ; Von Königslöw, Taika

  • Author_Institution
    Dept. of Math. & Stat., Univ. of Guelph, Guelph, ON
  • fYear
    2009
  • fDate
    March 30 2009-April 2 2009
  • Firstpage
    92
  • Lastpage
    99
  • Abstract
    This study presents an evolutionary algorithm for locating DNA sequence characters that are diagnostic between closely related groups of species. The algorithm is developed using synthetic data and then tested on biological data from a species of butterfly recently discovered to be a cryptic complex of species. This technique proved to be successful in locating positions that are diagnostic of the cryptic neotropical skipper butterfly species within the cytochrome c oxidase subunit I (COI) DNA barcode data. The algorithm uses a novel subset representation to select positions within the DNA sequences. A crossover operator that takes pairs of subsets to pairs of subsets is designed. This crossover operator permits the use of a novel mutation operator that disrupts loci showing evidence of convergence, yielding better preservation of diversity in the evolving population of diagnostic character positions. A lexical (tie breaking) fitness function is used to smooth the fitness landscape. The problem of locating diagnostic positions in DNA sequences proved difficult without lexical fitness; with that innovation in place the problem is quite tractable. The evolutionary algorithm developed has the potential for broad application such as in conservation, customs enforcement, and forensics.
  • Keywords
    DNA; biology computing; genomics; DNA sequence; cryptic skipper butterfly species complex; cytochrome c oxidase subunit I; diagnostic character location; evolutionary algorithm; lexical fitness function; mutation operator; Additives; DNA; Documentation; Evolutionary computation; Forensics; Hamming distance; Insects; Life estimation; Organisms; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology, 2009. CIBCB '09. IEEE Symposium on
  • Conference_Location
    Nashville, TN
  • Print_ISBN
    978-1-4244-2756-7
  • Type

    conf

  • DOI
    10.1109/CIBCB.2009.4925713
  • Filename
    4925713