DocumentCode :
4243
Title :
A Survival Modeling Approach to Biomedical Search Result Diversification Using Wikipedia
Author :
Xiaoshi Yin ; Huang, Jimmy Xiangji ; Zhoujun Li ; Xiaofeng Zhou
Author_Institution :
Sch. of Comput. Sci., Beihang Univ., Beijing, China
Volume :
25
Issue :
6
fYear :
2013
fDate :
Jun-13
Firstpage :
1201
Lastpage :
1212
Abstract :
In this paper, we propose a survival modeling approach to promoting ranking diversity for biomedical information retrieval. The proposed approach concerns with finding relevant documents that can deliver more different aspects of a query. First, two probabilistic models derived from the survival analysis theory are proposed for measuring aspect novelty. Second, a new method using Wikipedia to detect aspects covered by retrieved documents is presented. Third, an aspect filter based on a two-stage model is introduced. It ranks the detected aspects in decreasing order of the probability that an aspect is generated by the query. Finally, the relevance and the novelty of retrieved documents are combined at the aspect level for reranking. Experiments conducted on the TREC 2006 and 2007 Genomics collections demonstrate the effectiveness of the proposed approach in promoting ranking diversity for biomedical information retrieval. Moreover, we further evaluate our approach in the Web retrieval environment. The evaluation results on the ClueWeb09-T09B collection show that our approach can achieve promising performance improvements.
Keywords :
Web sites; document handling; genomics; information retrieval; medical information systems; ClueWeb09-T09B collection; TREC; Web retrieval environment; Wikipedia; biomedical information retrieval; biomedical search result diversification; genomics collections; probabilistic models; ranking diversity; retrieved documents; survival analysis theory; survival modeling approach; Bioinformatics; Electronic publishing; Encyclopedias; Genomics; Internet; Survival modeling; biomedical IR; diversity; rerank; thesaurus;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2012.24
Filename :
6152103
Link To Document :
بازگشت