• DocumentCode
    23542
  • Title

    Graphical Representation for DNA Sequences via Joint Diagonalization of Matrix Pencil

  • Author

    Hong-Jie Yu ; De-Shuang Huang

  • Author_Institution
    Dept. of Math., Anhui Sci. & Technol. Univ., Fengyang, China
  • Volume
    17
  • Issue
    3
  • fYear
    2013
  • fDate
    May-13
  • Firstpage
    503
  • Lastpage
    511
  • Abstract
    Graphical representations provide us with a tool allowing visual inspection of the sequences. To visualize and compare different DNA sequences, a novel alignment-free method is proposed in this paper for both graphical representation and similarity analysis of sequences. We introduce a transformation to represent each DNA sequence with neighboring nucleotide matrix. Then, based on approximate joint diagonalization theory, we transform each DNA primary sequence into a corresponding eigenvalue vector (EVV), which can be considered as numerical characterization of DNA sequence. Meanwhile, we get graphical representation for DNA sequence via the plot of EVV in 2-D plane. Moreover, using k-means, we cluster these feature curves of sequences into several reasonable subclasses. In addition, similarity analyses are performed by computing the distances among the obtained vectors. This approach contains more sequence information, and it analyzes all the involved sequence information jointly rather than separately. A typical dendrogram constructed by this method demonstrates the effectiveness of our approach.
  • Keywords
    DNA; bioinformatics; eigenvalues and eigenfunctions; graph theory; molecular biophysics; 2-D plane EVV plot; DNA primary sequence transformation; DNA sequence comparison; DNA sequence graphical representation; DNA sequence numerical characterization; DNA sequence visualization; alignment-free method; approximate joint diagonalization theory; dendrogram construction; eigenvalue vector; k-mean; matrix pencil joint diagonalization; neighboring nucleotide matrix; sequence feature curve clustering; sequence information analysis; sequence similarity analysis; sequence visual inspection; vector distance computation; Algorithm design and analysis; Approximation algorithms; DNA; Joints; Sparse matrices; Symmetric matrices; Vectors; Approximate joint diagonalization (AJD); dendrogram; graphical representation; similarity analysis;
  • fLanguage
    English
  • Journal_Title
    Biomedical and Health Informatics, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    2168-2194
  • Type

    jour

  • DOI
    10.1109/TITB.2012.2227146
  • Filename
    6417937