DocumentCode
110620
Title
Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources
Author
Quynh Thi Ngoc Do ; Bethard, Steven ; Moens, Marie-Francine
Author_Institution
Dept. of Comput. Sci., Katholieke Univ. Leuven, Leuven, Belgium
Volume
23
Issue
11
fYear
2015
fDate
Nov. 2015
Firstpage
1812
Lastpage
1823
Abstract
We propose a method for adapting Semantic Role Labeling (SRL) systems from a source domain to a target domain by combining a neural language model and linguistic resources to generate additional training examples. We primarily aim to improve the results of Location, Time, Manner and Direction roles. In our methodology, main words of selected predicates and arguments in the source-domain training data are replaced with words from the target domain. The replacement words are generated by a language model and then filtered by several linguistic filters (including Part-Of-Speech (POS), WordNet and Predicate constraints). In experiments on the out-of-domain CoNLL 2009 data, with the Recurrent Neural Network Language Model (RNNLM) and a well-known semantic parser from Lund University, we show enhanced recall and F1 without penalizing precision on the four targeted roles. These results improve the results of the same SRL system without using the language model and the linguistic resources, and are better than the results of the same SRL system that is trained with examples that are enriched with word embeddings. We also demonstrate the importance of using a language model and the vocabulary of the target domain when generating new training examples.
Keywords
computational linguistics; recurrent neural nets; POS; Predicate constraints; RNNLM; SRL systems; WordNet; domain adaptation; language model; linguistic filters; linguistic resources; neural language model; out-of-domain CoNLL 2009 data; part-of-speech; recurrent neural network language model; semantic parser; semantic role labeling; source domain; source-domain training data; target domain; word embeddings; IEEE transactions; Labeling; Pragmatics; Semantics; Syntactics; Training; Training data; Language model; linguistic resources; open domain; semantic role labeling;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2015.2449072
Filename
7131482
Link To Document