DocumentCode :
3207681
Title :
The influence of noisy patterns on the performance of learning methods in the splice junction recognition problem
Author :
Lorena, Ana C. ; Batista, Gustavo E A P A ; De Carvalho, André C P L F ; Monard, Maria C.
Author_Institution :
Inst. de Ciencias Matematicas a de Computacao, Univ. de Sao Paulo, Sao Carlos, Brazil
fYear :
2002
fDate :
2002
Firstpage :
31
Lastpage :
36
Abstract :
Since the beginning of the Human Genome Project, which aims at sequencing all the human´s genetic information, a large amount of sequence data has been generated. Much attention is now given to the analysis of this data. A great part of these analysis is carried out with the use of intelligent computational techniques. However, many of the genetic databases are characterized by the presence of noisy data, which can deteriorate the performance of the computational techniques applied. This work studies the influence of noisy data in the training of three different learning methods: decision trees, artificial neural networks and support vector machines. The task investigated is the recognition of splice junctions in DNA sequences, which is part of the gene identification problem. Results indicate that the elimination of noisy patterns from the dataset can improve the learning algorithms´ performance, with no significant reduction in their generalization ability.
Keywords :
DNA; biology computing; decision trees; learning (artificial intelligence); neural nets; pattern classification; Human Genome Project; decision trees; genetic databases; human genetic information; learning algorithms; neural networks; noisy data; splice junction recognition; support vector machines; Bioinformatics; Computational and artificial intelligence; Data analysis; Databases; Genetics; Genomics; Humans; Learning systems; Pattern recognition; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2002. SBRN 2002. Proceedings. VII Brazilian Symposium on
Print_ISBN :
0-7695-1709-9
Type :
conf
DOI :
10.1109/SBRN.2002.1181431
Filename :
1181431
Link To Document :
بازگشت