DocumentCode :
2709043
Title :
DECK: Detecting Events from Web Click-Through Data
Author :
Chen, Ling ; Hu, Yiqun ; Nejdl, Wolfgang
Author_Institution :
L3S Res. Center, Univ. of Hannover, Hannover
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
123
Lastpage :
132
Abstract :
In the past few years, there has been increased research interest in detecting previously unidentified events from Web resources. Our focus in this paper is to detect events from the click-through data generated by Web search engines. Existing event detection algorithms, which mainly study the news archive data, cannot be employed directly because of the following two unique features of click-through data: 1) the information provided by click-through data is quite limited; 2) not every query issued to a Web search engine corresponds to an event in the real world. In this paper, we address this problem by proposing an effective algorithm which Detects Events from ClicK-through data DECK. We firstly transform click-through data to the 2D polar space by considering the semantic dimension and temporal dimension of queries. Robust subspace estimation is performed to detect subspaces such that each subspace consists of queries of similar semantics. Next, we prune uninteresting subspaces which do not contain queries corresponding to real events by simultaneously considering the respective distribution of queries along the semantic dimension and the temporal dimension in each subspace. Finally, events are detected from interesting subspaces using a nonparametric clustering technique. Compared with an existing approach, our experimental results based on real-life data have shown that the proposed approach is more accurate and effective in detecting real events from click-through data.
Keywords :
Internet; data mining; pattern clustering; search engines; 2D polar space; DECK; Web click-through data; Web search engine; detects event from click-through data; nonparametric clustering technique; semantic dimension; subspace estimation; temporal dimension; Data engineering; Data mining; Data preprocessing; Event detection; Internet; Publishing activities; Robustness; Search engines; Uniform resource locators; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.78
Filename :
4781107
Link To Document :
بازگشت