DocumentCode
2699003
Title
Iterative Mining Translations from the Web
Author
Li, Fang ; Yuan, Shuangqing ; Sheng, Huanye
Author_Institution
Dept. of Computer Science and Engineering, Shanghai Jiao Tong University
fYear
2005
fDate
08-09 April 2005
Firstpage
12
Lastpage
16
Abstract
Multilingual translations play a vital role multilingual or cross-lingual information retrieval and extraction. In this paper, we describe a new method mine translations from bilingual web pages based on our former research. Two new features are introduced, one is the iterative mining process in order to increase the number of translation pairs; the other is the filtering step which deletes language-specific prefix and postfix in hyperlinks. Experiments show that the precision has been greatly improved due to the filtering step and the number of translation pairs increased after six iterations.
Keywords
Computer science; Data mining; Humans; IP networks; Information filtering; Information filters; Information retrieval; Iterative methods; Uniform resource locators; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
Print_ISBN
0-7695-2414-1
Type
conf
DOI
10.1109/WIRI.2005.24
Filename
1552990
Link To Document