DocumentCode :
423683
Title :
Representation of DNA sequences with multiple resolutions and BP neural network based classification
Author :
Huang, Xin ; Huang, De-Shuang ; Wang, Hong-Qiang ; Zhao, Xing-Ming
Author_Institution :
Inst. of Intelligent Machines, Chinese Acad. of Sci., Hefei, China
Volume :
2
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
1185
Abstract :
In this paper, we propose a new representation of DNA sequences, which constructs the word frequency vector with multiple resolutions based on the chaos game representation. Compared with the traditional vector, it combines a range of resolutions and reserves higher resolutions, but the dimension is reduced greatly relatively. The algorithm is detailed, which calculates coding format and codes each sequence. To evaluate the significance of our method, we represent Alu sequences by our proposed coding format. After that, the acquired vectors are used to train BP neural networks to recognize the Alu sequences. The experimental results show that this representation of DNA sequences is significant and efficient in biological data processing.
Keywords :
DNA; backpropagation; biology computing; chaos; neural nets; pattern classification; sequences; sequential codes; vector quantisation; Alu sequences; BP neural networks; DNA sequence representation; biological data processing; chaos game representation; coding format; multiple resolutions; pattern classification; sequential codes; training algorithm; word frequency vector; Biological information theory; Chaos; DNA; Frequency; Genomics; Intelligent networks; Machine intelligence; Neural networks; Sequences; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1380108
Filename :
1380108
Link To Document :
بازگشت