DocumentCode :
2735186
Title :
Cross-language information retrieval based on weight computation of query keywords translation
Author :
Zhang Xiao-fei ; Huang He-yan ; Zhang Ke-liang
Author_Institution :
Res. Center of Comput. & Language Inf. Eng., Chinese Acad. of Sci., Beijing, China
Volume :
3
fYear :
2009
fDate :
20-22 Nov. 2009
Firstpage :
253
Lastpage :
256
Abstract :
In cross-language information retrieval (CLIR), the query sentence is often combined with a series of query keywords, rather than a complete natural sentence. Lack of necessary contextual syntactic information in such a query sentence makes it impossible to achieve a unique translation of the query sentence with acceptable precision. In this paper, we convert the translation of query sentence to the weight computation of the translations of the query keyword based on large-scale bilingual parallel corpora, and thereafter reconstruct the query sentence in target language. The experimental results show that the approach achieves an average retrieval accuracy of 93.4% in the front 10 retrieval results and 89.1% in the front 100 retrieval results, while the retrieval error rate is reduced by 63.62% over the purely dictionary-based baseline.
Keywords :
natural language processing; query formulation; bilingual parallel corpora; cross-language information retrieval; query keywords translation; query sentence; translation weight computation; Computational linguistics; Concurrent computing; Error analysis; Information retrieval; Internet; Large-scale systems; Natural languages; Optical computing; Performance analysis; Testing; CLIR; query sentencee; translation of query keyword; weight computation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
Type :
conf
DOI :
10.1109/ICICISYS.2009.5358174
Filename :
5358174
Link To Document :
بازگشت