Title :
A Novel Language Model Based on Cognition Attention Attenuation in Web Retrieval
Author :
Cao, Donglin ; Xu, Hongbo ; Bai, Shuo ; Cheng, Xueqi ; Li, Shaozi
Author_Institution :
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing
Abstract :
Language model is widely used in many retrieval systems. Its document representation is based on the bag of words assumption. Hence, each term in document is treated as an equal object and only the term frequency is considered as the evidence of the importance of term. In this paper, we study the problem of cognition attention attenuation in processing documents and present a cognition attention attenuation based language model. This model estimates the document model by attenuation process of term in document. Compared with the classical language model, the advantage of this model is considering about the document structure which is often used in text summarization. From the experiments results, our novel cognition attention attenuation based language model outperformed the classical language model with Dirichlet smoothing in blog page and Web page.
Keywords :
Internet; cognition; computational linguistics; document handling; information retrieval; Web retrieval; bag-of-word assumption; cognition attention attenuation; document processing; document representation; language model; Attenuation; Cognition; Cognitive science; Computers; Couplings; Frequency; Hidden Markov models; Intelligent agent; Natural languages; Smoothing methods;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.19