DocumentCode :
1844235
Title :
Research on Query Translation Disambiguation for CLIR Based on HowNet
Author :
Zhu, Honglei ; Zheng, Dequan ; Zhao, Tiejun
Author_Institution :
MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin
fYear :
2008
fDate :
18-21 Nov. 2008
Firstpage :
1677
Lastpage :
1682
Abstract :
Query translation is an important task for cross-language information retrieval (CLIR), which aims at translating the query described in source language into target language. The approach to query translation based on bilingual dictionary is becoming the mainstream thinking because of its simplicity and the increasing availability of machine readable bilingual dictionary. However, this kind of approach faces two necessary problems that is ambiguity in translation and the incompleteness of the dictionary. This paper focuses on the first problem, and it presents three statistical models based on HowNet to resolve query translation ambiguity of CLIR: query translation selection based on semantic relation; bilingual decaying co-occurrence model and semantic decaying co-occurrence model. Through test and summarizing this paper gets the best algorithm to integrate the traits of the three models, which gradually filters and optimizes the translation.
Keywords :
dictionaries; information retrieval; language translation; statistical analysis; HowNet; bilingual decaying co-occurrence model; cross-language information retrieval; machine readable bilingual dictionary; query translation disambiguation; semantic decaying co-occurrence model; statistical model; Availability; Dictionaries; Filters; Information retrieval; Laboratories; Natural language processing; Natural languages; Speech processing; Statistical analysis; Testing; CLIR; OOV; Query translation; statistical method; translation selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Young Computer Scientists, 2008. ICYCS 2008. The 9th International Conference for
Conference_Location :
Hunan
Print_ISBN :
978-0-7695-3398-8
Electronic_ISBN :
978-0-7695-3398-8
Type :
conf
DOI :
10.1109/ICYCS.2008.19
Filename :
4709225
Link To Document :
بازگشت