Title :
Graphical Representation for DNA Sequences via Joint Diagonalization of Matrix Pencil
Author :
Hong-Jie Yu ; De-Shuang Huang
Author_Institution :
Dept. of Math., Anhui Sci. & Technol. Univ., Fengyang, China
Abstract :
Graphical representations provide us with a tool allowing visual inspection of the sequences. To visualize and compare different DNA sequences, a novel alignment-free method is proposed in this paper for both graphical representation and similarity analysis of sequences. We introduce a transformation to represent each DNA sequence with neighboring nucleotide matrix. Then, based on approximate joint diagonalization theory, we transform each DNA primary sequence into a corresponding eigenvalue vector (EVV), which can be considered as numerical characterization of DNA sequence. Meanwhile, we get graphical representation for DNA sequence via the plot of EVV in 2-D plane. Moreover, using k-means, we cluster these feature curves of sequences into several reasonable subclasses. In addition, similarity analyses are performed by computing the distances among the obtained vectors. This approach contains more sequence information, and it analyzes all the involved sequence information jointly rather than separately. A typical dendrogram constructed by this method demonstrates the effectiveness of our approach.
Keywords :
DNA; bioinformatics; eigenvalues and eigenfunctions; graph theory; molecular biophysics; 2-D plane EVV plot; DNA primary sequence transformation; DNA sequence comparison; DNA sequence graphical representation; DNA sequence numerical characterization; DNA sequence visualization; alignment-free method; approximate joint diagonalization theory; dendrogram construction; eigenvalue vector; k-mean; matrix pencil joint diagonalization; neighboring nucleotide matrix; sequence feature curve clustering; sequence information analysis; sequence similarity analysis; sequence visual inspection; vector distance computation; Algorithm design and analysis; Approximation algorithms; DNA; Joints; Sparse matrices; Symmetric matrices; Vectors; Approximate joint diagonalization (AJD); dendrogram; graphical representation; similarity analysis;
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
DOI :
10.1109/TITB.2012.2227146