DocumentCode :
2133977
Title :
Cross-language information retrieval using Web directories
Author :
Kimura, Fuminori ; Maeda, Akira ; Yoshikawa, Masatoshi ; Uemura, Shunsuke
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
Volume :
2
fYear :
2003
fDate :
28-30 Aug. 2003
Firstpage :
911
Abstract :
Since the Web consists of documents in various domains or genres, the method for cross-language information retrieval (CLIR) of Web documents should be independent of a particular domain. In this paper, we propose a CLIR method which employs a Web directory provided in multiple language versions (such as Yahoo!). In the proposed method, feature terms are first extracted from Web documents for each category in the source and the target languages. Then, one or more corresponding categories in another language are determined beforehand by comparing similarities between categories across languages. Using these category pairs, we intend to resolve ambiguities of simple dictionary translation by narrowing the categories to be retrieved in the target language.
Keywords :
Web sites; feature extraction; information retrieval; Web directories; Web documents; cross-language information retrieval; feature extraction; source language; target languages; Computer science; Data mining; Dictionaries; Educational institutions; Information retrieval; Information science; Information technology; Internet; Search engines; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
Print_ISBN :
0-7803-7978-0
Type :
conf
DOI :
10.1109/PACRIM.2003.1235931
Filename :
1235931
Link To Document :
بازگشت