Title of article :
Effect of Pronoun Resolution on Document Similarity
Author/Authors :
Atul Kumar، نويسنده , , Sudip Sanyal، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2010
Pages :
5
From page :
60
To page :
64
Abstract :
This paper presents a novel effect of Pronoun Resolution on measurement of document similarity. In this paper we have studied the effect of pronoun resolution within the framework of the Vector Space Model and Probabilistic Latent Semantic Analysis. For this purpose we have developed a Benchmark Corpus consisting of documents whose similarity scores have been given by human beings. We measured the inter-document similarity on these documents using VSM and PLSA. We then performed pronoun resolution on these documents and again calculated the similarity using both methods. Next, the correlation coefficient of the scores was taken with those of the human generated scores. The correlation coefficients clearly demonstrated substantial and consistent improvements of the similarity score after pronoun resolution.
Keywords :
Document Similarity , Information retrieval , Statistical Algorithm , pronoun resolution
Journal title :
International Journal of Computer Applications
Serial Year :
2010
Journal title :
International Journal of Computer Applications
Record number :
659499
Link To Document :
بازگشت