Title :
ListMotif: A time and memory efficient algorithm for weak motif discovery
Author :
Sun, He Quan ; Low, Malcolm Yoke Hean ; Hsu, Wen Jing ; Rajapakse, Jagath C.
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Weak motif discovery is a fundamental problem in computational biology. It is difficult to solve because the number of mismatches between the true motif and its mutated instances is so large that the spurious signals may disguise the true ones. While many algorithms have been proposed to solve this problem, they either require a large amount of memory or consume too much time. In this paper, a sample-driven algorithm, ListMotif, is proposed, which uses the substrings from the data to construct lists of motif instances. ListMotif is memory efficient and by avoiding re-calculations on the hamming distances between the substrings it also exhibits time efficiency. The experiments on real biological data have demonstrated its applicability in practice. Meanwhile, the test results on synthetic data show that ListMotif is able to discover longer and weaker motifs compared to some previously proposed algorithms.
Keywords :
biology computing; ListMotif; computational biology; memory efficient algorithm; sample-driven algorithm; time efficient algorithm; weak motif discovery; Algorithm design and analysis; Complexity theory; DNA; Hamming distance; Memory management; Table lookup;
Conference_Titel :
Intelligent Systems and Knowledge Engineering (ISKE), 2010 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-6791-4
DOI :
10.1109/ISKE.2010.5680875