DocumentCode
423683
Title
Representation of DNA sequences with multiple resolutions and BP neural network based classification
Author
Huang, Xin ; Huang, De-Shuang ; Wang, Hong-Qiang ; Zhao, Xing-Ming
Author_Institution
Inst. of Intelligent Machines, Chinese Acad. of Sci., Hefei, China
Volume
2
fYear
2004
fDate
25-29 July 2004
Firstpage
1185
Abstract
In this paper, we propose a new representation of DNA sequences, which constructs the word frequency vector with multiple resolutions based on the chaos game representation. Compared with the traditional vector, it combines a range of resolutions and reserves higher resolutions, but the dimension is reduced greatly relatively. The algorithm is detailed, which calculates coding format and codes each sequence. To evaluate the significance of our method, we represent Alu sequences by our proposed coding format. After that, the acquired vectors are used to train BP neural networks to recognize the Alu sequences. The experimental results show that this representation of DNA sequences is significant and efficient in biological data processing.
Keywords
DNA; backpropagation; biology computing; chaos; neural nets; pattern classification; sequences; sequential codes; vector quantisation; Alu sequences; BP neural networks; DNA sequence representation; biological data processing; chaos game representation; coding format; multiple resolutions; pattern classification; sequential codes; training algorithm; word frequency vector; Biological information theory; Chaos; DNA; Frequency; Genomics; Intelligent networks; Machine intelligence; Neural networks; Sequences; Statistical analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1380108
Filename
1380108
Link To Document