• DocumentCode
    3542681
  • Title

    Beyond seed match: Improving miRNA target prediction using PAR-CLIP data

  • Author

    Lu, Mingzhu ; Chen, C. L Philip ; Huang, Yufei

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Texas at San Antonio, San Antonio, TX, USA
  • fYear
    2011
  • fDate
    4-6 Dec. 2011
  • Firstpage
    127
  • Lastpage
    130
  • Abstract
    Since miRNA plays an important role in post-transcript regulation, many computational approaches have been proposed for miRNA target prediction. Yet, the existing algorithms lack the capability to predict the true target when the perfect seed match presents in mRNA sequences and methods based on seed-match still suffer from a high false positive rate. Therefore, this paper proposes a new prediction method that exploits the data produced by the PAR-CLIP, which is a recent high throughput, high precision technology for genome-wide miRNA targets. This algorithm searches true miRNA targets among the candidates with seed-matches by using machine learning approaches. The target prediction results on top 20 expressed miRNAs in HEK293 cells of AGO1-4 proteins PAR-CLIP data show that given presence of seed pairing, the proposed method greatly outperforms the traditional miRNA target prediction algorithms and improve the precision significantly. Because biologists usually need to mutate the seed region to validation the miRNA targets, and only capable of conducting biological experiments on limited miRNA and mRNA sequences due to the time and cost, the proposed approach will make significant impact on the biology and healthcare fields.
  • Keywords
    RNA; biology computing; data handling; genomics; learning (artificial intelligence); AGO1-4 proteins; HEK293 cells; PAR-CLIP data; mRNA sequences; machine learning; miRNA sequences; miRNA target prediction algorithms; post-transcript regulation; seed match; seed region; single-stranded ~22 nucleotides noncoding RNA; Bioinformatics; Context; Feature extraction; Genomics; Prediction algorithms; Training data; Gaussian Process; MicroRNA target prediction; PAR-CLIP; machine learning; seed matches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on
  • Conference_Location
    San Antonio, TX
  • ISSN
    2150-3001
  • Print_ISBN
    978-1-4673-0491-7
  • Electronic_ISBN
    2150-3001
  • Type

    conf

  • DOI
    10.1109/GENSiPS.2011.6169461
  • Filename
    6169461