Title :
Study of fuzzy resemblance measures for DNA motifs
Author :
Garcia, Fernando ; Lopez, Francisco J. ; Cano, Carlos ; Blanco, Armando
Author_Institution :
Dept. of Comput. Sci. & AI, Univ. of Granada, Granada, Spain
Abstract :
TFBSs are known as regulatory motifs and can be represented as position frequency matrices (PFMs). The de novo identification of transcription factor binding sites (TFBSs) is a crucial problem in computational biology and includes the issue of comparing putative TFBSs to one another and to already known TFBSs. To date there is no fuzzy approach for this problem. In this work we propose the use of fuzzy measures to deal with motif comparison tasks. We investigate the behavior of different classes of classical measures for fuzzy sets including set-theoretic (Jaccard´s method), proximity-based (Minkowsky´s r-metric), angular coefficient-based (Bhattacharyya´s distance) and a measure defined for the fuzzy polynucleotide space. We show that fuzzy measures provide excellent results when dealing with sets of randomly generated motifs and outperforms other existing measures when facing datasets of real motifs.
Keywords :
DNA; bioinformatics; fuzzy set theory; matrix algebra; DNA motifs; bioinformatics; computational biology; de novo identification; fuzzy polynucleotide space; fuzzy resemblance measure; fuzzy set theory; position frequency matrices; regulatory motif; transcription factor binding site; Artificial intelligence; Computational biology; Computer science; DNA; Frequency; Fuzzy set theory; Fuzzy sets; Proteins; Pulse width modulation; Testing;
Conference_Titel :
Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on
Conference_Location :
Jeju Island
Print_ISBN :
978-1-4244-3596-8
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2009.5277089