• DocumentCode
    110620
  • Title

    Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources

  • Author

    Quynh Thi Ngoc Do ; Bethard, Steven ; Moens, Marie-Francine

  • Author_Institution
    Dept. of Comput. Sci., Katholieke Univ. Leuven, Leuven, Belgium
  • Volume
    23
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    1812
  • Lastpage
    1823
  • Abstract
    We propose a method for adapting Semantic Role Labeling (SRL) systems from a source domain to a target domain by combining a neural language model and linguistic resources to generate additional training examples. We primarily aim to improve the results of Location, Time, Manner and Direction roles. In our methodology, main words of selected predicates and arguments in the source-domain training data are replaced with words from the target domain. The replacement words are generated by a language model and then filtered by several linguistic filters (including Part-Of-Speech (POS), WordNet and Predicate constraints). In experiments on the out-of-domain CoNLL 2009 data, with the Recurrent Neural Network Language Model (RNNLM) and a well-known semantic parser from Lund University, we show enhanced recall and F1 without penalizing precision on the four targeted roles. These results improve the results of the same SRL system without using the language model and the linguistic resources, and are better than the results of the same SRL system that is trained with examples that are enriched with word embeddings. We also demonstrate the importance of using a language model and the vocabulary of the target domain when generating new training examples.
  • Keywords
    computational linguistics; recurrent neural nets; POS; Predicate constraints; RNNLM; SRL systems; WordNet; domain adaptation; language model; linguistic filters; linguistic resources; neural language model; out-of-domain CoNLL 2009 data; part-of-speech; recurrent neural network language model; semantic parser; semantic role labeling; source domain; source-domain training data; target domain; word embeddings; IEEE transactions; Labeling; Pragmatics; Semantics; Syntactics; Training; Training data; Language model; linguistic resources; open domain; semantic role labeling;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2449072
  • Filename
    7131482