Title :
Using signal processing techniques for DNA sequence comparison
Author :
Cheever, E.A. ; Searls, D.B. ; Karunaratne, W. ; Overton, G.C.
Author_Institution :
Dept. of Eng., Swarthmore Coll., PA, USA
Abstract :
The most widely used algorithm for the comparison of two sequences of DNA are O(m×n) on the lengths, m and n, of the sequences been compared. The authors present a comparison algorithm that is O(nlog n) on the length, n, of the longer sequence. This algorithm uses techniques developed for rapid comparison of two discrete signals, in particular, cross-correlation using the fast Fourier transform (FFT). The authors treat the DNA as a discrete signal with each nucleotide base represented by a single point in the signal. There are only four possible values that the signal can assume which they represent by one of four complex numbers. The comparison is made by performing a cross correlation between one signal and the complex conjugate of the other. Any significant peak in the resulting signal indicates a strong similarity between the two sequences. The authors present the results of comparison of two strains of the human immunodeficiency virus and of human and simian immunodeficiency viruses. Their results suggest that this technique is a powerful method for comparing very long sequences of DNA
Keywords :
DNA; molecular biophysics; signal processing; DNA sequence comparison; comparison algorithm; complex numbers; cross correlation; fast Fourier transform; human immunodeficiency virus; nucleotide base; signal processing techniques; simian immunodeficiency virus; Bioinformatics; DNA; Educational institutions; Fast Fourier transforms; Fourier transforms; Genomics; Humans; Sequences; Signal processing; Signal processing algorithms;
Conference_Titel :
Bioengineering Conference, 1989., Proceedings of the 1989 Fifteenth Annual Northeast
Conference_Location :
Boston, MA
DOI :
10.1109/NEBC.1989.36756