DocumentCode :
3268901
Title :
Substructure clustering on sequential 3d object datasets
Author :
Tan, Zhenqiang ; Tung, Anthony K H
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore
fYear :
2004
fDate :
30 March-2 April 2004
Firstpage :
634
Lastpage :
645
Abstract :
We look at substructure clustering of sequential 3d objects. A sequential 3d object is a set of points located in a three dimensional space that are linked up to form a sequence. Given a set of sequential 3d objects, our aim is to find significantly large substructures which are present in many of the sequential 3d objects. Unlike traditional subspace clustering methods in which objects are compared based on values in the same dimension, the matching dimensions between two 3d sequential objects are affected by both the translation and rotation of the objects and are thus not well defined. Instead, similarity between the objects are judge by computing a structural distance measurement call rmsd (Root Mean Square Distance) which require proper alignment (including translation and rotation) of the objects. As the computation of rmsd is expensive, we proposed a new measure call ald (Angle Length Distance) which is shown experimentally to approximate rmsd. Based on ald, we define a new clustering model called sCluster and devise an algorithm for discovering all maximum sCluster in a 3d sequential dataset. Experiments are conducted to illustrate the efficiency and effectiveness of our algorithm.
Keywords :
computational geometry; data mining; pattern clustering; statistical analysis; angle length distance; root mean square distance; sCluster clustering model; sequential 3d object dataset; structural distance measurement; subspace clustering methods; substructure clustering; Amino acids; Clustering algorithms; Clustering methods; Data mining; Distance measurement; Drugs; Molecular biophysics; Organisms; Protein engineering; Tree graphs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN :
1063-6382
Print_ISBN :
0-7695-2065-0
Type :
conf
DOI :
10.1109/ICDE.2004.1320033
Filename :
1320033
Link To Document :
بازگشت