DocumentCode :
3207673
Title :
A symbolic approach to gene expression time series analysis
Author :
Costa, Ivan G. ; De Carvalho, Francisco De A T ; De Souto, Marcílio C P
Author_Institution :
Centre for Informatics, Fed. Univ. of Pernambuco, Brazil
fYear :
2002
fDate :
2002
Firstpage :
25
Lastpage :
30
Abstract :
In the analysis of gene expression time series, emphasis has been given on the capture of shape similarity (or dissimilarity). A number of proximity functions have been proposed for this task. However, none of them will suitably measure shape similarity (or dissimilarity) with data containing multiple gene expression time series, unless special data handling is made. In this paper, a symbolic description of multiple gene expression time series, where each variable takes as a value a time series, in conjunction with a version of a proximity measure, is proposed. In this symbolic approach, the shape similarity of each time series is calculated independently, and aggregated at the end. Gene expression data from five distinct time series are presented to a symbolic dynamical clustering method and self-organising map algorithm. The quality of the results obtained is evaluated using gene annotation allowing a verification of this proposal´s adequacy.
Keywords :
genetics; pattern clustering; self-organising feature maps; symbol manipulation; time series; gene expression; proximity functions; proximity measure; self-organising map; shape similarity; symbolic description; symbolic dynamical clustering; time series; Biological processes; Cells (biology); Clustering algorithms; Data handling; Euclidean distance; Gene expression; Informatics; Shape measurement; Time measurement; Time series analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2002. SBRN 2002. Proceedings. VII Brazilian Symposium on
Print_ISBN :
0-7695-1709-9
Type :
conf
DOI :
10.1109/SBRN.2002.1181430
Filename :
1181430
Link To Document :
بازگشت