Title :
Data visualization methodologies for data mining systems in bioinformatics
Author :
Staiano, A. ; Ciaramella, A. ; Raiconi, G. ; Tagliaferri, R. ; Amato, R. ; Longo, G. ; Miele, G. ; Donalek, C.
Author_Institution :
Dept. of Math. & Informatics, Salerno Univ., Fisciano, Italy
fDate :
31 July-4 Aug. 2005
Abstract :
Bioinformatics systems benefit from the use of data mining strategies to locate interesting and pertinent relationships within massive information. For example, data mining methods can ascertain and summarize the set of genes responding to a certain level of stress in an organism. Even a cursory glance through the literature in journals, reveals the persistent role of data mining in experimental biology. Integrating data mining within the context of experimental investigations is central to bioinformatics software. In this paper we describe the framework of probabilistic principal surfaces, a latent variable model which offers a large variety of appealing visualization capabilities and which can be successfully integrated in the context of microarray analysis. A preprocessing phase consisting of a nonlinear PCA neural network which seems to be very useful to deal with noisy and time dependent nature of microarray data has been added to this framework.
Keywords :
biology computing; data mining; data visualisation; neural nets; principal component analysis; bioinformatics software; bioinformatics systems; data mining systems; data visualization; latent variable model; microarray analysis; microarray data; nonlinear PCA neural network; probabilistic principal surfaces; Bioinformatics; Biological system modeling; Context modeling; Data mining; Data visualization; Neural networks; Organisms; Phase noise; Principal component analysis; Stress;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1555820