Title :
A new iterative graph based clustering method
Author :
Pikoulis, E. ; Psarakis, Emmanouil Z.
Author_Institution :
Dept. of Comput. Eng. & Inf., Univ. of Patras, Rio-Patras, Greece
Abstract :
In this paper we propose a new iterative graph based clustering technique. The proposed method has three desirable characteristics as compared to well known clustering techniques. Specifically, because of its iterative nature, the number of clusters contained in a given data set it is not necessary to be known a priori as partitional clustering techniques demand. Its performance, in terms of the quality of the achieved clustering, does not depend on the distribution of the cluster´s size. Finally, no threshold value is required for achieving the clustering as hierarchical clustering techniques demand. All these features manifest themselves in the important problem of clustering homologous proteins when only sequence information is available. The proposed method is tested against well known and widely used techniques, by conducting a number of experiments based on both artificial and real protein data sets and all above mentioned characteristics were confirmed.
Keywords :
biology computing; genomics; graph theory; iterative methods; pattern clustering; proteins; sequences; artificial protein data sets; hierarchical clustering techniques; homologous proteins; iterative graph based clustering method; partitional clustering techniques; real protein data sets; sequence information; Clustering algorithms; Clustering methods; Couplings; Histograms; Mutual information; Proteins; Sociology;
Conference_Titel :
Bioinformatics & Bioengineering (BIBE), 2012 IEEE 12th International Conference on
Conference_Location :
Larnaca
Print_ISBN :
978-1-4673-4357-2
DOI :
10.1109/BIBE.2012.6399734