Title :
LDA based PSEUDO relevance feedback for cross language information retrieval
Author :
Xuwen Wang ; Qiang Zhang ; Xiaojie Wang ; Yueping Sun
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
fDate :
Oct. 30 2012-Nov. 1 2012
Abstract :
This paper introduced a LDA-based pseudo relevance feedback (PRF) model for cross language information retrieval. To validate the performance of PRF techniques in CLIR task, we conducted cross language query expansion experiments based on a self-constructed CLIR system, the LDA-based PRF model was applied before or after the query translating process, namely the pre-translation-PRF, the post-translation-PRF, and the combined-PRF strategy. We also compared this model with the classical VSM-based PRF algorithm. Experiment results showed that the proposed LDA-based PRF method was effective for improving the performance of CLIR.
Keywords :
natural language processing; query processing; relevance feedback; vectors; CLIR task; LDA based pseudo relevance feedback; PRF technique performance; combined-PRF strategy; cross language information retrieval; cross language query expansion experiments; post-translation-PRF; pretranslation-PRF; query translating process; self-constructed CLIR system; Approximation methods; Data models; Google; Information retrieval; Research and development; Resource management; Vectors; PSEUDO relevance feed back; cross language information retrieval; latent dirichlet allocation (LDA); query expansion; vector space model (VSM);
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
DOI :
10.1109/CCIS.2012.6664637