DocumentCode :
86527
Title :
An Improved Approximation Algorithm for Scaffold Filling to Maximize the Common Adjacencies
Author :
Nan Liu ; Haitao Jiang ; Daming Zhu ; Binhai Zhu
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
Volume :
10
Issue :
4
fYear :
2013
fDate :
July-Aug. 2013
Firstpage :
905
Lastpage :
913
Abstract :
Scaffold filling is a new combinatorial optimization problem in genome sequencing. The one-sided scaffold filling problem can be described as given an incomplete genome I and a complete (reference) genome G, fill the missing genes into I such that the number of common (string) adjacencies between the resulting genome I´ and G is maximized. This problem is NP-complete for genome with duplicated genes and the best known approximation factor is 1.33, which uses a greedy strategy. In this paper, we prove a better lower bound of the optimal solution, and devise a new algorithm by exploiting the maximum matching method and a local improvement technique, which improves the approximation factor to 1.25. For genome with gene repetitions, this is the only known NP-complete problem which admits an approximation with a small constant factor (less than 1.5).
Keywords :
genetics; genomics; optimisation; NP-complete problem; common adjacencies; genome sequence; greedy strategy; maximum matching method; one-sided scaffold filling problem; optimization problem; Algorithm design and analysis; Approximation algorithms; Approximation methods; Bioinformatics; Educational institutions; Genomics; Sequential analysis; Comparative genomics; NP-completeness; adjacencies; algorithms; breakpoints; scaffold filling;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.100
Filename :
6582406
Link To Document :
بازگشت