• DocumentCode
    1844235
  • Title

    Research on Query Translation Disambiguation for CLIR Based on HowNet

  • Author

    Zhu, Honglei ; Zheng, Dequan ; Zhao, Tiejun

  • Author_Institution
    MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin
  • fYear
    2008
  • fDate
    18-21 Nov. 2008
  • Firstpage
    1677
  • Lastpage
    1682
  • Abstract
    Query translation is an important task for cross-language information retrieval (CLIR), which aims at translating the query described in source language into target language. The approach to query translation based on bilingual dictionary is becoming the mainstream thinking because of its simplicity and the increasing availability of machine readable bilingual dictionary. However, this kind of approach faces two necessary problems that is ambiguity in translation and the incompleteness of the dictionary. This paper focuses on the first problem, and it presents three statistical models based on HowNet to resolve query translation ambiguity of CLIR: query translation selection based on semantic relation; bilingual decaying co-occurrence model and semantic decaying co-occurrence model. Through test and summarizing this paper gets the best algorithm to integrate the traits of the three models, which gradually filters and optimizes the translation.
  • Keywords
    dictionaries; information retrieval; language translation; statistical analysis; HowNet; bilingual decaying co-occurrence model; cross-language information retrieval; machine readable bilingual dictionary; query translation disambiguation; semantic decaying co-occurrence model; statistical model; Availability; Dictionaries; Filters; Information retrieval; Laboratories; Natural language processing; Natural languages; Speech processing; Statistical analysis; Testing; CLIR; OOV; Query translation; statistical method; translation selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Young Computer Scientists, 2008. ICYCS 2008. The 9th International Conference for
  • Conference_Location
    Hunan
  • Print_ISBN
    978-0-7695-3398-8
  • Electronic_ISBN
    978-0-7695-3398-8
  • Type

    conf

  • DOI
    10.1109/ICYCS.2008.19
  • Filename
    4709225