Title :
The impact of collocational features in Turkish Word Sense Disambiguation
Author :
Bahar İlgen;Eşref Adali;A. Cüneyd Tantuğ
Author_Institution :
Computer and Informatics Faculty, Istanbul Technical University, 34469, Maslak, Turkey
fDate :
6/1/2012 12:00:00 AM
Abstract :
Word Sense Disambiguation (WSD) is the task of choosing the most appropriate sense of a word having multiple senses in a given context. Collocational features acquired from the words in neighborship with the ambiguous word are one of the important knowledge sources in this area. This paper explores the effective sets of collocational features in Turkish in order to obtain better Turkish WSD systems. A lexical sample dataset of highly polysemous nouns and verbs has been prepared as the initial step of the work. Several supervised learning algorithms have been tested on this data by supplying different feature sets to select the best performing features for both nouns and verbs in Turkish. Also, we investigated the impact of several collocational features of polysemous words and evaluated the performance of several supervised machine learning algorithms.
Keywords :
"Accuracy","Dictionaries","Context","Natural language processing","Conferences","Educational institutions","Computational linguistics"
Conference_Titel :
Intelligent Engineering Systems (INES), 2012 IEEE 16th International Conference on
Print_ISBN :
978-1-4673-2694-0
DOI :
10.1109/INES.2012.6249891