Title :
A novel information contents based similarity metric for comparing TFBS motifs
Author :
Zhang, Shaoqiang ; Jiang, Lifen ; Du, Chuanbin ; Su, Zhengchang
Author_Institution :
Coll. of Comput. & Inf. Eng., Tianjin Normal Univ., Tianjin, China
Abstract :
Identifying binding sites recognized by transcription factors (TFs) is one of major challenges to decipher complex genetic regulatory networks encoded in a genome. A set of binding sites recognized by the same TF, called a motif, can be accurately represented by a position frequency matrix (PFM) or a position-specific scoring matrix (PSSM). Very often, we need to compare motifs when searching for similar motifs in a motif database for a query motif, or clustering motifs possibly recognized by the same TF. In this paper, we have designed a novel metric, called SPIC (Similarity between Positions with Information Contents), for quantifying the similarity between two motifs using their PFMs, PSSMs, and column information contents, and demonstrated that this metric outperforms the other state-of-the-art methods for clustering motifs of the same TF and differentiating motifs of different TFs.
Keywords :
genetics; genomics; SPIC metric; Similarity between Positions with Information Contents; TFBS motif; binding site identification; clustering motif; genetic regulatory network; genome; information contents based similarity metric; motif database; position frequency matrix; position specific scoring matrix; query motif; transcription factor; Bioinformatics; Clustering algorithms; Conferences; Databases; Genomics; Prediction algorithms; information contents; motifs, regulatory networks; similarity metric; transcription factor binding sites (TFBS);
Conference_Titel :
Systems Biology (ISB), 2012 IEEE 6th International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4673-4396-1
Electronic_ISBN :
978-1-4673-4397-8
DOI :
10.1109/ISB.2012.6314109