Title :
Finding terminology translations from hyperlinks on the Internet
Author :
Yuan, Shuang-qing ; Li, Fang ; Sheng, Huan-Ye
Author_Institution :
Dept. of Comput. Sci., Shanghai Jiao Tong Univ., China
Abstract :
In this paper, we describe a novel method to find terminology translations from hyperlinks between bilingual homepages on the Internet. The recognition of terminology and its translation is according to the similarities of their hyperlinks. A hyperlink can be regarded as a vector. The similarity of two vectors is calculated based on the Dice coefficient. Experimental results show that the method is reasonable and useful, and can be applied to any language pairs and domains for multilingual information retrieval and extraction.
Keywords :
Internet; Web sites; data mining; language translation; natural languages; nomenclature; Dice coefficient; Internet; Web mining; bilingual homepages; bilingual terminology extraction; hyperlinks; language pairs; multilingual information retrieval; terminology recognition; terminology translations; unparallel corpus extraction; vector similarity; Computer science; Data mining; Electronic mail; Information retrieval; Internet; Libraries; Natural languages; Terminology; Web mining; Web pages;
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
DOI :
10.1109/ICMLC.2002.1176813