DocumentCode
3378032
Title
Identification of transcription factor binding sites based on the Chi-Square (x2) distance of a probabilistic vector model
Author
Huang, Lun ; Al Bataineh, Mohammad ; Atkin, G.E. ; Mohammed, Ismaeel ; Zhang, Wei ; Parra, Maria ; Del Mar Perez, Maria
Author_Institution
ECE Dept., Illinois Inst. of Technol., Chicago, IL, USA
fYear
2009
fDate
13-14 Dec. 2009
Firstpage
73
Lastpage
76
Abstract
This paper describes a new approach for locating signals, such as promoter sequences, in nucleic acid sequences. Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position weight matrix (PWM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. In this paper, we present a Chi-square ( x2 ) distance model, which is based on the distance between the profiles of component vectors. It is a novel probabilistic method for modeling TF-DNA interactions. Our approach uses x2 distances to represent TF binding specificities. Simulation results show that the proposed approach identifies TF binding sites significantly better than the PWM model method.
Keywords
DNA; biology computing; statistical distributions; Chi-Square distance; DNA target site; nucleic acid sequences; position weight matrix; probabilistic vector model; signal location; transcription factor binding; Chi-square distance; Transcription Factor; promoter;
fLanguage
English
Publisher
ieee
Conference_Titel
BioMedical Information Engineering, 2009. FBIE 2009. International Conference on Future
Conference_Location
Sanya
Print_ISBN
978-1-4244-4690-2
Electronic_ISBN
978-1-4244-4692-6
Type
conf
DOI
10.1109/FBIE.2009.5405793
Filename
5405793
Link To Document