• DocumentCode
    3466692
  • Title

    Towards Large-scale High-Performance English Verb Sense Disambiguation by Using Linguistically Motivated Features

  • Author

    Chen, Jinying ; Dligach, Dmitriy ; Palmer, Martha

  • Author_Institution
    BBN Technol., Cambridge
  • fYear
    2007
  • fDate
    17-19 Sept. 2007
  • Firstpage
    378
  • Lastpage
    388
  • Abstract
    In this paper we describe the results of training high performance word sense disambiguation (WSD) systems on a new data set based on groupings of WordNet senses. This data set is designed to provide clear sense distinctions with sufficient examples in order to provide high quality training data. The sense distinctions are based on explicit syntactic and semantic criteria. Our WSD features utilize similar syntactic and semantic linguistic information. We demonstrate that this approach, using both maximum entropy and SVM models, produces systems whose performance is comparable to that of humans.
  • Keywords
    maximum entropy methods; natural language processing; support vector machines; word processing; SVM models; WordNet senses; large-scale high-performance English verb sense disambiguation; linguistically motivated features; maximum entropy; semantic linguistic information; syntactic linguistic information; word sense disambiguation; Computer science; Data mining; Entropy; High performance computing; Humans; Large-scale systems; Natural languages; Support vector machines; System performance; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing, 2007. ICSC 2007. International Conference on
  • Conference_Location
    Irvine, CA
  • Print_ISBN
    978-0-7695-2997-4
  • Type

    conf

  • DOI
    10.1109/ICSC.2007.69
  • Filename
    4338372