DocumentCode :
1659876
Title :
Search beyond Traditional Probabilistic Information Retrieval
Author :
Huang, Jimmy
Author_Institution :
York Univ., Toronto, ON, Canada
Volume :
1
fYear :
2011
Firstpage :
5
Lastpage :
5
Abstract :
Most of the traditional Information Retrieval models are based on the assumption that query terms are independent of each other and a document is represented as a bag of words. Nevertheless this assumption may not hold in practice. In this talk, I will discuss how the query terms associate with each other and how to incorporate the term proximity information into the classical probabilistic IR models. I will discuss the relationship between document length and its relevance and how to balance between the Verbosity and Scope hypotheses by modeling document length within the probabilistic weighting model. I will also present how to incorporate this relationship into the classical BM25 models. Through extensive experiments on standard large-scale TREC Web collections, I will show that the extended models are able to markedly outperform the BM25 baseline and at least comparable to the state-of-the-art model. The talk will conclude with a discussion of novel challenges raised in extending probabilistic Information Retrieval and several applications such as promoting diversity in ranking for biomedical IR, sentiment analysis for predicting sales performance and EMR data analysis for effective health care.
Keywords :
information retrieval; probability; EMR data analysis; biomedical IR; classical BM25 models; classical probabilistic IR models; document length; health care; information retrieval model; large-scale TREC Web collections; probabilistic information retrieval; probabilistic weighting model; query terms; sales performance; sentiment analysis; term proximity information; Awards activities; Biological system modeling; Conferences; Educational institutions; Information retrieval; Medical services; Probabilistic logic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
Type :
conf
DOI :
10.1109/WI-IAT.2011.289
Filename :
6040730
Link To Document :
بازگشت