• DocumentCode
    2028656
  • Title

    Automatic Creation of N-lingual Synonymous Word Sets

  • Author

    Wu, YYanchen ; Li, Fang ; Tanaka, ZRie ; Ishida, Toru

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2008
  • fDate
    3-5 Dec. 2008
  • Firstpage
    141
  • Lastpage
    148
  • Abstract
    Multilingual dictionaries are very useful in machine translations and natural language processing. However,a multilingual dictionary including all natural languages still does not exist. In this paper we propose a trustworthy method to automatically create multilingual dictionary represented by N-lingual synonymous word sets (N-tuples, hereafter). Based on the work of 3-lingual synonymous word sets, our method has extended 3-lingual to n-lingual synonymous word sets from multiple bilingual dictionaries. By matching and combining the triples instead of the binary relations in the bilingual dictionaries,the complexity of the problem is significantly reduced. Using this method, we created 4-lingual synonymous word sets among Chinese, Japanese, English and German. The evaluations indicate that our combining algorithm has effectively solved the error accumulation problem and achieved a very promising quality. In the example application, the 4-tuples are used to refine the translation quality of a multi-hop machine translator created on the language grid. It shows that utilizing the handy online services and uniform platform in research work is a good methodology.
  • Keywords
    dictionaries; language translation; natural language processing; natural languages; security of data; Chinese; English; German; Japanese; N-lingual synonymous word sets; error accumulation problem; machine translations; multihop machine translator; multilingual dictionary; natural language processing; trustworthy method; Computer science; Data mining; Dictionaries; Informatics; Joining processes; Knowledge engineering; Laboratories; Natural language processing; Natural languages; Technological innovation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantics, Knowledge and Grid, 2008. SKG '08. Fourth International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-0-7695-3401-5
  • Electronic_ISBN
    978-0-7695-3401-5
  • Type

    conf

  • DOI
    10.1109/SKG.2008.22
  • Filename
    4725907