DocumentCode
3207673
Title
A symbolic approach to gene expression time series analysis
Author
Costa, Ivan G. ; De Carvalho, Francisco De A T ; De Souto, Marcílio C P
Author_Institution
Centre for Informatics, Fed. Univ. of Pernambuco, Brazil
fYear
2002
fDate
2002
Firstpage
25
Lastpage
30
Abstract
In the analysis of gene expression time series, emphasis has been given on the capture of shape similarity (or dissimilarity). A number of proximity functions have been proposed for this task. However, none of them will suitably measure shape similarity (or dissimilarity) with data containing multiple gene expression time series, unless special data handling is made. In this paper, a symbolic description of multiple gene expression time series, where each variable takes as a value a time series, in conjunction with a version of a proximity measure, is proposed. In this symbolic approach, the shape similarity of each time series is calculated independently, and aggregated at the end. Gene expression data from five distinct time series are presented to a symbolic dynamical clustering method and self-organising map algorithm. The quality of the results obtained is evaluated using gene annotation allowing a verification of this proposal´s adequacy.
Keywords
genetics; pattern clustering; self-organising feature maps; symbol manipulation; time series; gene expression; proximity functions; proximity measure; self-organising map; shape similarity; symbolic description; symbolic dynamical clustering; time series; Biological processes; Cells (biology); Clustering algorithms; Data handling; Euclidean distance; Gene expression; Informatics; Shape measurement; Time measurement; Time series analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2002. SBRN 2002. Proceedings. VII Brazilian Symposium on
Print_ISBN
0-7695-1709-9
Type
conf
DOI
10.1109/SBRN.2002.1181430
Filename
1181430
Link To Document