Title :
Approximate Common Intervals in Multiple Genome Comparison
Author :
Chateau, Annie ; Riou, Pierre ; Rivals, Eric
Author_Institution :
LIRMM, Univ. Montpellier 2, Montpellier, France
Abstract :
We consider the problem of inferring approximate common intervals of multiple genomes. Genomes are modelled as sequences of homologous genes families identifiers, and approximate common intervals represent conserved regions possibly showing rearrangements, as well as repetitions, or insertions/deletions. This problem is already known, but existing approaches are not incremental and somehow limited to special cases. We adopt a simple, classical graph-based approach, where the vertices of the graph represent the exact common intervals of the sequences (i.e., regions containing the same gene set), and where edges link vertices that differ by less than δ elements (with δ being parameter). With this model, approximate gene clusters are maximal cliques of the graph: computing them can exploit known and well designed algorithms. For a proof of concept, we applied the method to several datasets of bacterial genomes and compared the two maximal cliques algorithms, a static and a dynamic one. While being quite flexible, this approach opens the way to a combinatorial characterization of genomic rearrangements in terms of .
Keywords :
biology computing; data visualisation; genomics; graph theory; molecular biophysics; approximate gene cluster; bacterial genome dataset; dynamic maximal clique algorithm; gene deletion; gene insertion; gene rearrangement; gene repetition; genes family identifier sequence; graph substructure; graph vertex; multiple genome interval; static maximal clique algorithm; Approximation algorithms; Bioinformatics; Clustering algorithms; Genomics; Heuristic algorithms; Microorganisms;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
DOI :
10.1109/BIBM.2011.96