مرکز منطقه ای اطلاع رساني علوم و فناوري - An algorithm for string edit distance allowing substring reversals

DocumentCode :

2771256

Title :

An algorithm for string edit distance allowing substring reversals

Author :

Arslan, Abdullah N.

Author_Institution :

Dept. of Comput. Sci., Vermont Univ., Burlington, VT

fYear :

2006

fDate :

16-18 Oct. 2006

Firstpage :

220

Lastpage :

226

Abstract :

The edit distance between given two strings X and Y is the minimum number of edit operations that transform X into Y. Ordinarily, string editing is based on character insert, delete, and substitute operations. It has been suggested that extending this model with block (substring) edits would be useful in applications such as DNA sequence comparison. In its general form, the resulting problem is NP-hard. However, there are efficient algorithms when string edits include only character, and block replacements. We introduce a new edit model which permits insertions, deletions, and substitutions at character level, and also reversals of substrings. We present an algorithm whose worst-case time complexity is O(n²m) where n=|X|lesm=|Y|, and we prove that the average running time of the algorithm is O(nm). Our experiments on randomly generated strings verify these results. The main contribution of this paper is that we present an algorithm to find all possible reversals using a generalized suffix tree, which is fast on average

Keywords :

DNA; biology computing; computational complexity; molecular biophysics; string matching; trees (mathematics); DNA sequence comparison; block replacements; block reversal; character delete; character insert; character substitute operations; generalized suffix tree; randomly generated strings; string edit distance; substring reversals; Bioinformatics; Biological system modeling; Biology computing; Computational modeling; Computer science; DNA; Genetic mutations; Genomics; Sequences; Sorting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on

Conference_Location :

Arlington, VA

Print_ISBN :

0-7695-2727-2

Type :

conf

DOI :

10.1109/BIBE.2006.253338

Filename :

4019663

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2771256