• DocumentCode
    2699003
  • Title

    Iterative Mining Translations from the Web

  • Author

    Li, Fang ; Yuan, Shuangqing ; Sheng, Huanye

  • Author_Institution
    Dept. of Computer Science and Engineering, Shanghai Jiao Tong University
  • fYear
    2005
  • fDate
    08-09 April 2005
  • Firstpage
    12
  • Lastpage
    16
  • Abstract
    Multilingual translations play a vital role multilingual or cross-lingual information retrieval and extraction. In this paper, we describe a new method mine translations from bilingual web pages based on our former research. Two new features are introduced, one is the iterative mining process in order to increase the number of translation pairs; the other is the filtering step which deletes language-specific prefix and postfix in hyperlinks. Experiments show that the precision has been greatly improved due to the filtering step and the number of translation pairs increased after six iterations.
  • Keywords
    Computer science; Data mining; Humans; IP networks; Information filtering; Information filters; Information retrieval; Iterative methods; Uniform resource locators; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
  • Print_ISBN
    0-7695-2414-1
  • Type

    conf

  • DOI
    10.1109/WIRI.2005.24
  • Filename
    1552990