Title :
A symbolic approach to gene expression time series analysis
Author :
Costa, Ivan G. ; De Carvalho, Francisco De A T ; De Souto, Marcílio C P
Author_Institution :
Centre for Informatics, Fed. Univ. of Pernambuco, Brazil
Abstract :
In the analysis of gene expression time series, emphasis has been given on the capture of shape similarity (or dissimilarity). A number of proximity functions have been proposed for this task. However, none of them will suitably measure shape similarity (or dissimilarity) with data containing multiple gene expression time series, unless special data handling is made. In this paper, a symbolic description of multiple gene expression time series, where each variable takes as a value a time series, in conjunction with a version of a proximity measure, is proposed. In this symbolic approach, the shape similarity of each time series is calculated independently, and aggregated at the end. Gene expression data from five distinct time series are presented to a symbolic dynamical clustering method and self-organising map algorithm. The quality of the results obtained is evaluated using gene annotation allowing a verification of this proposal´s adequacy.
Keywords :
genetics; pattern clustering; self-organising feature maps; symbol manipulation; time series; gene expression; proximity functions; proximity measure; self-organising map; shape similarity; symbolic description; symbolic dynamical clustering; time series; Biological processes; Cells (biology); Clustering algorithms; Data handling; Euclidean distance; Gene expression; Informatics; Shape measurement; Time measurement; Time series analysis;
Conference_Titel :
Neural Networks, 2002. SBRN 2002. Proceedings. VII Brazilian Symposium on
Print_ISBN :
0-7695-1709-9
DOI :
10.1109/SBRN.2002.1181430