• DocumentCode
    3300440
  • Title

    NP tree matching for English-Chinese translation of patent titles

  • Author

    Cai, Dongfeng ; Lin, Xiaoqing ; Ji, Duo ; Yang, Jie

  • Author_Institution
    Knowledge Eng. Res. Center, Shenyang Inst. of Aeronaut. Eng., Shenyang
  • fYear
    2008
  • fDate
    19-22 Oct. 2008
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    This paper proposes a method of NP tree matching to realize the translation of English-Chinese patent titles. Firstly a bilingual example database for patent titles is built. English parse trees are produced by English parser, forming NP tree database. The input patent title to be translated is firstly parsed into a tree. Then NP trees are searched for which match with the input NP tree in NP tree database. If similar NP trees exist, HowNet is used to find the best NP tree by calculating word semantic similarity. Final translations are obtained through calculating cohesion of candidate words. If there are no similar NP trees, subtrees that match the input NP tree are searched for and translations generate by subtree substitutions recursively. Experimental results show that our method outperforms a baseline Pharaoh, by using BLEU evaluation system.
  • Keywords
    grammars; language translation; natural language processing; patents; trees (mathematics); English parse trees; English-Chinese patent title; English-Chinese translation; HowNet; NP tree database; NP tree matching; Bones; Communications technology; Databases; Impedance matching; Information technology; Knowledge engineering; Mechanical variables measurement; Neural networks; Real time systems; Switches; HowNet; NP translation pattern; NP tree; Patent terms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4515-8
  • Electronic_ISBN
    978-1-4244-2780-2
  • Type

    conf

  • DOI
    10.1109/NLPKE.2008.4906759
  • Filename
    4906759