Title :
Cross-language information retrieval via dictionary-based and statistics-based methods
Author :
Sadat, Fatiha ; Maeda, Akira ; Yoshikawa, Masatoshi ; UEMURA, Shunsuke
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
fDate :
6/23/1905 12:00:00 AM
Abstract :
As Internet resources become accessible to more and more countries, there is a need to develop methods for Cross Language Information Retrieval for different languages. In this paper, we focus on dictionary-based approach by using a bilingual dictionary, with a combination to statistics-based methods to avoid the problem of ambiguity. Interactive feedback loops are integrated, in the task of query expansion before and after the disambiguation of the translated candidates. In this study, we propose three sorts of query expansions to improve the effectiveness of information retrieval and to dramatically reduce the errors such an approach normally makes: an Interactive Relevance Feedback, a Domain Feedback and a Similarity Thesaurus. We applied these methods to an English-French Cross-Language Information Retrieval. In terms of average precision, a 91.95% and 99.13% of the monolingual counterpart was achieved for different combinations
Keywords :
dictionaries; language translation; relevance feedback; thesauri; English-French cross-language information retrieval; Internet resources; bilingual dictionary; cross-language information retrieval; dictionary-based methods; domain feedback; interactive feedback loops; interactive relevance feedback; query expansions; similarity thesaurus; statistics-based methods; training corpora; Dictionaries; Electronic mail; Feedback loop; Informatics; Information retrieval; Information science; Internet; Search engines; Statistics; Thesauri;
Conference_Titel :
Communications, Computers and signal Processing, 2001. PACRIM. 2001 IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-7080-5
DOI :
10.1109/PACRIM.2001.953703