DocumentCode :
1418802
Title :
A Biologically Inspired Validity Measure for Comparison of Clustering Methods over Metabolic Data Sets
Author :
Stegmayer, Georgina ; Milone, Diego H. ; Kamenetzky, Laura ; López, Mariana G. ; Carrari, Fernando
Author_Institution :
Center for R&D of Inf. Syst. (CIDISI, UTN-FRSF, Santa Fe, Argentina
Volume :
9
Issue :
3
fYear :
2012
Firstpage :
706
Lastpage :
716
Abstract :
In the biological domain, clustering is based on the assumption that genes or metabolites involved in a common biological process are coexpressed/coaccumulated under the control of the same regulatory network. Thus, a detailed inspection of the grouped patterns to verify their memberships to well-known metabolic pathways could be very useful for the evaluation of clusters from a biological perspective. The aim of this work is to propose a novel approach for the comparison of clustering methods over metabolic data sets, including prior biological knowledge about the relation among elements that constitute the clusters. A way of measuring the biological significance of clustering solutions is proposed. This is addressed from the perspective of the usefulness of the clusters to identify those patterns that change in coordination and belong to common pathways of metabolic regulation. The measure summarizes in a compact way the objective analysis of clustering methods, which respects coherence and clusters distribution. It also evaluates the biological internal connections of such clusters considering common pathways. The proposed measure was tested in two biological databases using three clustering methods.
Keywords :
biochemistry; biology computing; genetics; molecular biophysics; statistical analysis; biological internal connections; biologically inspired validity; clustering methods; clusters distribution; coherence; genes; metabolic data sets; metabolic regulation; metabolites; regulatory network; Bioinformatics; Biological processes; Clustering algorithms; Clustering methods; Coherence; Couplings; Clustering; biological assessment; metabolic pathways.; validation measure; Algorithms; Cluster Analysis; Databases, Factual; Gene Regulatory Networks; Metabolic Networks and Pathways; Reproducibility of Results;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.10
Filename :
6127857
Link To Document :
بازگشت