• DocumentCode
    3320077
  • Title

    Decoding Trace Peak Behaviour - A Neuro-Fuzzy Approach

  • Author

    Thornley, David ; Petridis, Stavros

  • Author_Institution
    Imperial Coll. London, London
  • fYear
    2007
  • fDate
    23-26 July 2007
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    DNA sequence basecalling is commonly regarded as a solved problem, despite significant error rates being reflected in inaccuracies in databases and genome annotations. This has made measures of confidence of basecalls important, and fuzzy methods have recently been used to approximate confidence by responding to data quality at the calling position. We have demonstrated that variation in contextual sequencing trace data peak heights actively encodes novel information which can be used for basecalling and confidence estimation. Using neuro-fuzzy classifiers we are able to decode much of the hidden contextual information in two fuzzy rules per base and partially reveal its underlying behaviour. Those two fuzzy rules can satisfactory explain over 74% of data samples. The error rate is 6-7% higher on individual bases than when using classification trees, but the number of rules is reduced by a factor of 100. Compact comprehensible knowledge representation is achieved with the use of SANFIS which allows us to easily interpret the embedded knowledge. Finally, we propose a hybrid architecture based on SANFIS which achieves slightly better performance than a classification tree with significantly improved knowledge representation.
  • Keywords
    DNA; biocomputing; fuzzy neural nets; fuzzy set theory; knowledge representation; DNA sequence; SANFIS; basecalling; confidence estimation; contextual sequencing trace data peak; fuzzy methods; fuzzy rules per base; hidden contextual information; knowledge representation; neuro-fuzzy approach; neuro-fuzzy classifiers; Bioinformatics; Classification tree analysis; DNA; Databases; Decoding; Error analysis; Genomics; Knowledge representation; Position measurement; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
  • Conference_Location
    London
  • ISSN
    1098-7584
  • Print_ISBN
    1-4244-1209-9
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2007.4295658
  • Filename
    4295658