• DocumentCode
    3378032
  • Title

    Identification of transcription factor binding sites based on the Chi-Square (x2) distance of a probabilistic vector model

  • Author

    Huang, Lun ; Al Bataineh, Mohammad ; Atkin, G.E. ; Mohammed, Ismaeel ; Zhang, Wei ; Parra, Maria ; Del Mar Perez, Maria

  • Author_Institution
    ECE Dept., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2009
  • fDate
    13-14 Dec. 2009
  • Firstpage
    73
  • Lastpage
    76
  • Abstract
    This paper describes a new approach for locating signals, such as promoter sequences, in nucleic acid sequences. Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position weight matrix (PWM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. In this paper, we present a Chi-square ( x2 ) distance model, which is based on the distance between the profiles of component vectors. It is a novel probabilistic method for modeling TF-DNA interactions. Our approach uses x2 distances to represent TF binding specificities. Simulation results show that the proposed approach identifies TF binding sites significantly better than the PWM model method.
  • Keywords
    DNA; biology computing; statistical distributions; Chi-Square distance; DNA target site; nucleic acid sequences; position weight matrix; probabilistic vector model; signal location; transcription factor binding; Chi-square distance; Transcription Factor; promoter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioMedical Information Engineering, 2009. FBIE 2009. International Conference on Future
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-4690-2
  • Electronic_ISBN
    978-1-4244-4692-6
  • Type

    conf

  • DOI
    10.1109/FBIE.2009.5405793
  • Filename
    5405793