Title of article :
Exploiting the Web as the Multilingual Corpus
for Unknown Query Translation
Author/Authors :
Jenq-Haur Wang، نويسنده , , Jei-Wen Teng، نويسنده , , and Wen-Hsiang Lu، نويسنده , , Lee-Feng Chien، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2006
Abstract :
Users’ cross-lingual queries to a digital library system
might be short and the query terms may not be included
in a common translation dictionary (unknown terms). In
this article, the authors investigate the feasibility of
exploiting the Web as the multilingual corpus source to
translate unknown query terms for cross-language information
retrieval in digital libraries. They propose a Webbased
term translation approach to determine effective
translations for unknown query terms by mining bilingual
search-result pages obtained from a real Web
search engine. This approach can enhance the construction
of a domain-specific bilingual lexicon and
bring multilingual support to a digital library that only
has monolingual document collections. Very promising
results have been obtained in generating effective translation
equivalents for many unknown terms, including
proper nouns, technical terms, and Web query terms,
and in assisting bilingual lexicon construction for a real
digital library system.
Journal title :
Journal of the American Society for Information Science and Technology
Journal title :
Journal of the American Society for Information Science and Technology