DocumentCode :
1338858
Title :
Answering General Time-Sensitive Queries
Author :
Dakka, Wisam ; Gravano, Luis ; Ipeirotis, Panagiotis G.
Author_Institution :
Google, Mountain View, CA, USA
Volume :
24
Issue :
2
fYear :
2012
Firstpage :
220
Lastpage :
235
Abstract :
Time is an important dimension of relevance for a large number of searches, such as over blogs and news archives. So far, research on searching over such collections has largely focused on locating topically similar documents for a query. Unfortunately, topic similarity alone is not always sufficient for document ranking. In this paper, we observe that, for an important class of queries that we call time-sensitive queries, the publication time of the documents in a news archive is important and should be considered in conjunction with the topic similarity to derive the final document ranking. Earlier work has focused on improving retrieval for “recency” queries that target recent documents. We propose a more general framework for handling time-sensitive queries and we automatically identify the important time intervals that are likely to be of interest for a query. Then, we build scoring techniques that seamlessly integrate the temporal aspect into the overall ranking mechanism. We present an extensive experimental evaluation using a variety of news article data sets, including TREC data as well as real web data analyzed using the Amazon Mechanical Turk. We examine several techniques for detecting the important time intervals for a query over a news archive and for incorporating this information in the retrieval process. We show that our techniques are robust and significantly improve result quality for time-sensitive queries compared to state-of-the-art retrieval techniques.
Keywords :
document handling; query processing; document ranking; information retrieval process; news archive; news article data set; recency query; scoring technique; time-sensitive query; topic similarity; Computational modeling; Google; Histograms; Information retrieval; Mathematical model; Query processing; Search methods; Smoothing methods; Time frequency analysis; Information search and retrieval; processing time-sensitive queries; time-sensitive search.;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2010.187
Filename :
5590248
Link To Document :
بازگشت