DocumentCode :
1556628
Title :
Efficient Multidimensional Fuzzy Search for Personal Information Management Systems
Author :
Wang, Wei ; Peery, Christopher ; Marian, Am\\élie ; Nguyen, Thu D.
Author_Institution :
Rutgers University, Piscataway
Volume :
24
Issue :
9
fYear :
2012
Firstpage :
1584
Lastpage :
1597
Abstract :
With the explosion in the amount of semistructured data users access and store in personal information management systems, there is a critical need for powerful search tools to retrieve often very heterogeneous data in a simple and efficient way. Existing tools typically support some IR-style ranking on the textual part of the query, but only consider structure (e.g., file directory) and metadata (e.g., date, file type) as filtering conditions. We propose a novel multidimensional search approach that allows users to perform fuzzy searches for structure and metadata conditions in addition to keyword conditions. Our techniques individually score each dimension and integrate the three dimension scores into a meaningful unified score. We also design indexes and algorithms to efficiently identify the most relevant files that match multidimensional queries. We perform a thorough experimental evaluation of our approach and show that our relaxation and scoring framework for fuzzy query conditions in noncontent dimensions can significantly improve ranking accuracy. We also show that our query processing strategies perform and scale well, making our fuzzy search approach practical for every day usage.
Keywords :
Indexing; Information management; Information retrieval; Optimization; Query processing; XML; Information retrieval; multidimensional search; personal information management system; query processing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.126
Filename :
5887334
Link To Document :
بازگشت