DocumentCode :
3097127
Title :
Intron Identification Approaches Based on Weighted Features and Fuzzy Decision Trees
Author :
Huang, Yin-Fu ; Liang, Ching Ping ; Liou, Sing-Wu
Author_Institution :
Grad. Sch. of Comput. Sci. & Inf. Eng., Nat. Yunlin Univ. of Sci. & Technol. Touliu, Touliu, Taiwan
fYear :
2010
fDate :
18-20 June 2010
Firstpage :
1
Lastpage :
4
Abstract :
Current computational predictions of splice sites largely depend on the sequence patterns of known intronic sequence features (ISFs) described in the classical intron definition model (IDM). The computation-oriented IDM (CO-IDM) clearly provides more specific and concrete information for describing intron flanks of splice sites (IFSSs). In the paper, we proposed a novel approach of fuzzy decision trees (FDTs) which utilize 1) weighted ISFs of twelve uni-frame patterns (UFPs) and forty-five multi-frame patterns (MFPs) and 2) gain ratios to improve the performances in identifying an intron. First, we fuzzified extracted features from genomic sequences using membership functions with an unsupervised self-organizing map (SOM) technique. Then, we brought in different viewpoints of globally weighting and crossly referring in generating fuzzy rules which are interpretable and useful for biologists to verify whether a sequence is an intron or not. Finally, the experimental results revealed the effectiveness of the proposed method in improving the identification accuracy. Besides, we also implemented an on-line intronic identifier to infer an unknown genomic sequence.
Keywords :
biology computing; decision trees; feature extraction; fuzzy set theory; genomics; macromolecules; pattern classification; self-organising feature maps; computational prediction; features extraction; fuzzy decision tree; fuzzy rules generation; genomic sequence; intron identification approach; intronic sequence feature; membership function; multiframe pattern; splice site; uniframe pattern; unsupervised self organizing map technique; weighted feature; Bioinformatics; Biology computing; Computational modeling; Computer science; Concrete; Decision trees; Diversity reception; Genomics; Predictive models; Pulse width modulation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
ISSN :
2151-7614
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
Type :
conf
DOI :
10.1109/ICBBE.2010.5515314
Filename :
5515314
Link To Document :
بازگشت