DocumentCode :
445831
Title :
Effect of non-target examples on E.coli promoters recognition using neural networks
Author :
Conilione, Paul C. ; Wang, Dianhui
Author_Institution :
Dept. of Comput. Sci. & Comput. Eng., La Trobe Univ., Melbourne, Vic., Australia
Volume :
1
fYear :
2005
fDate :
31 July-4 Aug. 2005
Firstpage :
310
Abstract :
Previous research into the recognition of E.coli promoters has focused on the use of raw DNA sequences and alignment methods to find interesting features in the promoter regions. In this paper, we aim to compare the classification accuracy of a neural network trained on DNA sequences encoded using orthogonal representation of the nucleotides, and a set of high level features from the DNA. In addition to this, we evaluate the impact of different types of non-promoters used in training and testing on the classification accuracy. 872 E.coli promoters were used and three types of non-promoters, which included random sequences with the same base frequency as the promoter sequences, genes sequences selected from E.coli and random sequences with the same base frequencies as the gene non-promoters. Raw DNA sequences were encoded using CODE-4 and high level features, which were outlined by previous researchers and subsequently formally defined in this paper. We found that the high level features did not perform as well for promoter recognition compared with CODE-4 DNA representation, contrary to expectation. The strongest determining factor in classification accuracy was the type of non-promoter used for training and testing. Overall non-promoters from coding regions and random sequences with the same base frequency as the gene non-promoter resulted in the best classification accuracy.
Keywords :
biology computing; neural nets; DNA sequences; E.coli promoters recognition; alignment method; neural networks; nontarget examples; orthogonal nucleotide representation; Biochemistry; DNA; Frequency; Neural networks; Pattern recognition; Polymers; RNA; Random sequences; Statistical analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
Type :
conf
DOI :
10.1109/IJCNN.2005.1555848
Filename :
1555848
Link To Document :
بازگشت