DocumentCode :
1415376
Title :
Efficient Fuzzy Type-Ahead Search in XML Data
Author :
Feng, Jianhua ; Li, Guoliang
Author_Institution :
Dept. of Comput. Sci., Tsinghua Univ., Beijing, China
Volume :
24
Issue :
5
fYear :
2012
fDate :
5/1/2012 12:00:00 AM
Firstpage :
882
Lastpage :
895
Abstract :
In a traditional keyword-search system over XML data, a user composes a keyword query, submits it to the system, and retrieves relevant answers. In the case where the user has limited knowledge about the data, often the user feels “left in the dark” when issuing queries, and has to use a try-and-see approach for finding information. In this paper, we study fuzzy type-ahead search in XML data, a new information-access paradigm in which the system searches XML data on the fly as the user types in query keywords. It allows users to explore data as they type, even in the presence of minor errors of their keywords. Our proposed method has the following features: 1) Search as you type: It extends Autocomplete by supporting queries with multiple keywords in XML data. 2) Fuzzy: It can find high-quality answers that have keywords matching query keywords approximately. 3) Efficient: Our effective index structures and searching algorithms can achieve a very high interactive speed. We study research challenges in this new search framework. We propose effective index structures and top-k algorithms to achieve a high interactive speed. We examine effective ranking functions and early termination techniques to progressively identify the top-k relevant answers. We have implemented our method on real data sets, and the experimental results show that our method achieves high search efficiency and result quality.
Keywords :
XML; fuzzy set theory; query processing; user interfaces; XML data; extensible markup language; fuzzy type-ahead search; index structure; information-access paradigm; keyword query; keyword-search system; searching algorithm; top-k algorithm; top-k relevant answers; try-and-see approach; user query; Browsers; Indexes; Keyword search; Mice; Microwave integrated circuits; Servers; XML; XML; fuzzy search.; keyword search; type-ahead search;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2010.264
Filename :
5677525
Link To Document :
بازگشت