Title :
Representation of DNA sequences with multiple resolutions and BP neural network based classification
Author :
Huang, Xin ; Huang, De-Shuang ; Wang, Hong-Qiang ; Zhao, Xing-Ming
Author_Institution :
Inst. of Intelligent Machines, Chinese Acad. of Sci., Hefei, China
Abstract :
In this paper, we propose a new representation of DNA sequences, which constructs the word frequency vector with multiple resolutions based on the chaos game representation. Compared with the traditional vector, it combines a range of resolutions and reserves higher resolutions, but the dimension is reduced greatly relatively. The algorithm is detailed, which calculates coding format and codes each sequence. To evaluate the significance of our method, we represent Alu sequences by our proposed coding format. After that, the acquired vectors are used to train BP neural networks to recognize the Alu sequences. The experimental results show that this representation of DNA sequences is significant and efficient in biological data processing.
Keywords :
DNA; backpropagation; biology computing; chaos; neural nets; pattern classification; sequences; sequential codes; vector quantisation; Alu sequences; BP neural networks; DNA sequence representation; biological data processing; chaos game representation; coding format; multiple resolutions; pattern classification; sequential codes; training algorithm; word frequency vector; Biological information theory; Chaos; DNA; Frequency; Genomics; Intelligent networks; Machine intelligence; Neural networks; Sequences; Statistical analysis;
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
Print_ISBN :
0-7803-8359-1
DOI :
10.1109/IJCNN.2004.1380108