DocumentCode :
110620
Title :
Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources
Author :
Quynh Thi Ngoc Do ; Bethard, Steven ; Moens, Marie-Francine
Author_Institution :
Dept. of Comput. Sci., Katholieke Univ. Leuven, Leuven, Belgium
Volume :
23
Issue :
11
fYear :
2015
fDate :
Nov. 2015
Firstpage :
1812
Lastpage :
1823
Abstract :
We propose a method for adapting Semantic Role Labeling (SRL) systems from a source domain to a target domain by combining a neural language model and linguistic resources to generate additional training examples. We primarily aim to improve the results of Location, Time, Manner and Direction roles. In our methodology, main words of selected predicates and arguments in the source-domain training data are replaced with words from the target domain. The replacement words are generated by a language model and then filtered by several linguistic filters (including Part-Of-Speech (POS), WordNet and Predicate constraints). In experiments on the out-of-domain CoNLL 2009 data, with the Recurrent Neural Network Language Model (RNNLM) and a well-known semantic parser from Lund University, we show enhanced recall and F1 without penalizing precision on the four targeted roles. These results improve the results of the same SRL system without using the language model and the linguistic resources, and are better than the results of the same SRL system that is trained with examples that are enriched with word embeddings. We also demonstrate the importance of using a language model and the vocabulary of the target domain when generating new training examples.
Keywords :
computational linguistics; recurrent neural nets; POS; Predicate constraints; RNNLM; SRL systems; WordNet; domain adaptation; language model; linguistic filters; linguistic resources; neural language model; out-of-domain CoNLL 2009 data; part-of-speech; recurrent neural network language model; semantic parser; semantic role labeling; source domain; source-domain training data; target domain; word embeddings; IEEE transactions; Labeling; Pragmatics; Semantics; Syntactics; Training; Training data; Language model; linguistic resources; open domain; semantic role labeling;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2449072
Filename :
7131482
Link To Document :
بازگشت