Title :
Structural similarity computation based on extended edge matching method
Author :
Yang, Ting ; Wei, Jinmao ; Fan, Baoquan ; Wang, Xu ; Zhang, Haiwei
Author_Institution :
Coll. of Inf. Tech. Sci., Nankai Univ., Tianjin, China
Abstract :
Similarity measurement of hierarchically structured data is crucial in the data mining, database and information retrieval communities. In this paper, we propose an extended edge matching method for better evaluating structural similarity between XML(eXtensible Markup Language) documents. The proposed method not only generates edges between parent nodes and child nodes, but also generates topological edges between ancestor nodes and descendant nodes. Furthermore, complete, topological and repeated matchings are distinguished in the process of edge matching. When one edge matches another, the method can identify the type of matching, and assign proper weight to it. Experiments demonstrated that the proposed method generated better similarity results and clustering results in comparison with some existing methods.
Keywords :
XML; data mining; edge detection; information retrieval; pattern clustering; pattern matching; topology; XML documents; child nodes; data mining; database retrieval communities; eXtensible markup language documents; extended edge matching method; information retrieval communities; parent nodes; similarity measurement; structural similarity computation; topological edges generation; Clustering algorithms; Complexity theory; Data mining; Educational institutions; Image edge detection; Pattern matching; XML;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6233716