DocumentCode :
2826345
Title :
Content and Structure Based Approach For XML Similarity
Author :
Ma, Yanru ; Chbeir, Richard
Author_Institution :
Lab. LE2I, Bourgogne Univ., Dijon
fYear :
2005
fDate :
21-23 Sept. 2005
Firstpage :
136
Lastpage :
140
Abstract :
Since the last decade, XML has become inevitable for complex data representation. In this paper, we address a problem of measuring the similarity between XML documents and propose a new XML document similarity approach, which considers the asymmetric similarity and the similarity of both semantic content and document structure. Here, we only consider the measurement of similarity between two XML documents based on the same schema. A prototype has been implemented to validate and evaluate the performances of our proposal. We do believe that our method can also be used to evaluate the similarity of other tree-structured complex data
Keywords :
XML; content management; document handling; tree data structures; XML document similarity; data representation; semantic content similarity; tree-structured complex data; Data mining; Feature extraction; Information retrieval; Internet; MPEG 7 Standard; Performance evaluation; Proposals; Prototypes; Taxonomy; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2005. CIT 2005. The Fifth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
0-7695-2432-X
Type :
conf
DOI :
10.1109/CIT.2005.91
Filename :
1562641
Link To Document :
بازگشت