DocumentCode :
2034189
Title :
A new indexing strategy for XML keyword search
Author :
Xiang, Yongqing ; Deng, Zhihong ; Yu, Hang ; Wang, Sijing ; Gao, Ning
Author_Institution :
Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
Volume :
5
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
2412
Lastpage :
2416
Abstract :
With the rapid increase of XML documents on the web, how to index, store and retrieve these documents has become a very popular and valuable problem. At present, there are two normal ways of retrieving XML documents. One is structure-based retrieval; the other is keyword-based retrieval. However, XML keyword search is becoming more and more popular because it is easy to master and manipulate. In XML keyword search system, a key problem is how to store the structure information into XML indices efficiently. At present, Dewey numbers are often used to label XML nodes in XML indices. However, Dewey numbers may lead to redundancy in XML indices. In this paper, we propose a new labeling method called LAF numbers for XML indices and we device a new indexing structure called Two-Layer index for XML keyword retrieval systems. At last, we have conducted an extensive experimental study and the experimental results show that our indexing method achieves better space efficiency than prevailing Dewey-number-based indexing method.
Keywords :
XML; digital arithmetic; indexing; information retrieval systems; redundancy; semantic Web; Dewey numbers; LAF numbers; XML documents; document retrieval; indexing; keyword retrieval systems; keyword search system; redundancy; structure based retrieval; two-layer index; web; Encoding; Indexing; Keyword search; Labeling; Redundancy; XML; Dewey numbers; Indexing; LAF numbers; Two-Layer; XML keyword Search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
Type :
conf
DOI :
10.1109/FSKD.2010.5569522
Filename :
5569522
Link To Document :
بازگشت