DocumentCode :
3030967
Title :
Fuzzy Classification of Genome Sequences Prior to Assembly Based on Similarity Measures
Author :
Nasser, Sara ; Vert, Gregory L. ; Breland, Adrienne ; Nicolescu, Monica
Author_Institution :
Univ. of Nevada, Reno
fYear :
2007
fDate :
24-27 June 2007
Firstpage :
354
Lastpage :
359
Abstract :
Nucleotide sequencing of genomic data is an important step towards building understanding of gene expression. Current limitations in sequencing limit the number of base pairs that can be processed to only several hundred at a time. Consequently, these sequenced substrings need to be assembled into the overall genome. However, the existence of insertions, deletions and substitutions can complicate the assembly of subsequences and confuse existing methods. What has been needed is an approach that deals with ambiguity in trying to match and assemble a genome from its sequenced subsequences. This research develops fuzzy similarity measures between subsequences that are then incorporated into an assembler based on fuzzy logic and fuzzy similarity measures. The research addresses the problem of extensive computation required by clustering data into meaningful groups. Preliminary evaluation of this approach in conjunction with K-Means clustering suggests that this approach is at least as good as standard approaches and in some cases better.
Keywords :
DNA; biology computing; fuzzy logic; fuzzy set theory; genetics; pattern classification; pattern clustering; sequences; DNA; fuzzy classification; fuzzy logic; fuzzy similarity measure; genome sequence; nucleotide sequencing; pattern clustering; Assembly; Bioinformatics; Computer science; DNA; Data engineering; Fuzzy logic; Gene expression; Genomics; Nuclear measurements; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Information Processing Society, 2007. NAFIPS '07. Annual Meeting of the North American
Conference_Location :
San Diego, CA
Print_ISBN :
1-4244-1213-7
Electronic_ISBN :
1-4244-1214-5
Type :
conf
DOI :
10.1109/NAFIPS.2007.383864
Filename :
4271087
Link To Document :
بازگشت