DocumentCode
3309147
Title
Semi-supervised learning for word sense disambiguation using parallel corpora
Author
Mo Yu ; Shu Wang ; Conghui Zhu ; Tiejun Zhao
Author_Institution
MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin, China
Volume
3
fYear
2011
fDate
26-28 July 2011
Firstpage
1490
Lastpage
1494
Abstract
The Application of word sense disambiguation (WSD) methods based on supervised machine learning are limited by the difficulties in defining sense tags and acquiring labeled data for training. In this paper, the two problems of WSD are solved in a semi-supervised learning framework with the help of parallel corpora. The sense tags are defined automatically according to the results of word alignment on the parallel corpora. And label propagation, a graph-based semi-supervised algorithm, is employed. The experiments show that our method achieves great improvement on Chinese WSD tasks and the performances get significant growth when the scale of monolingual sentences is increasing.
Keywords
computational linguistics; learning (artificial intelligence); monolingual sentences; parallel corpora; semisupervised learning; supervised machine learning; word alignment; word sense disambiguation; Accuracy; Computational linguistics; Computers; Conferences; Support vector machines; Training; Training data; label propagation; parallel corpora; semi-supervised learning; word sense disambiguation;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-61284-180-9
Type
conf
DOI
10.1109/FSKD.2011.6019785
Filename
6019785
Link To Document