• DocumentCode
    3154696
  • Title

    Promoter prediction using DNA numerical representation and neural network: Case study with three organisms

  • Author

    Arniker, Swarna Bai ; Kwan, Hon Keung ; Law, Ngai-Fong ; Lun, Daniel Pak-Kong

  • Author_Institution
    Directorate of Laser Syst., Res. Centre Imarat, Hyderabad, India
  • fYear
    2011
  • fDate
    16-18 Dec. 2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Promoter recognition in various organisms is an area of interest in bioinformatics. In this paper, a feed-forward neural network classifier is presented to predict promoters in three organisms using a DNA numerical representation approach. The proposed system was found to be able to predict promoters with a sensitivity of 87%, 87%, 99% while reducing false prediction rate for non-promoter sequences with a specificity of 92%, 94%, 99% for the human, Drosophila melanogaster, and Arabidopsis thaliana sequences respectively. The results show that feed-forward neural networks can extract the statistical characteristics of promoters efficiently, and that the 2-bit binary coding for DNA data is suitable for the Berkeley Human and Drosophila datasets and the 4-bit binary is suitable for the TAIR Arabidopsis thaliana data sets. Another result demonstrated here is that the proposed prediction system is reconfigurable and versatile with a reduced architecture and computational complexity.
  • Keywords
    DNA; bioinformatics; feedforward neural nets; statistical analysis; DNA numerical representation; TAIR Arabidopsis thaliana data sets; computational complexity; feed-forward neural network classifier; neural network; promoter prediction; statistical characteristics; Biological neural networks; DNA; Encoding; Genomics; Humans; Sensitivity; Arabidopsis thaliana; DNA numerical representation; Drosophila melanogaster; bioinformatics; neural networks; promoter recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2011 Annual IEEE
  • Conference_Location
    Hyderabad
  • Print_ISBN
    978-1-4577-1110-7
  • Type

    conf

  • DOI
    10.1109/INDCON.2011.6139397
  • Filename
    6139397