DocumentCode :
1857911
Title :
Unsupervised combination of metrics for semantic class induction
Author :
Iosif, E. ; Tegos, A. ; Pangos, A. ; Fosler-Lussier, E. ; Potamianos, A.
Author_Institution :
Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania
fYear :
2006
fDate :
10-13 Dec. 2006
Firstpage :
86
Lastpage :
89
Abstract :
In this paper, unsupervised algorithms for combining semantic similarity metrics are proposed for the problem of automatic class induction. The automatic class induction algorithm is based on the work of Pargellis et al,. The semantic similarity metrics that are evaluated and combined are based on narrow- and wide-context vector- product similarity. The metrics are combined using linear weights that are computed ´on the fly´ and are updated at each iteration of the class induction algorithm, forming a corpus-independent metric. Specifically, the weight of each metric is selected to be inversely proportional to the inter-class similarity of the classes induced by that metric and for the current iteration of the algorithm. The proposed algorithms are evaluated on two corpora: a semantically heterogeneous news domain (HR-Net) and an application-specific travel reservation corpus (ATIS). It is shown, that the (unsupervised) adaptive weighting scheme outperforms the (supervised) fixed weighting scheme. Up to 50% relative error reduction is achieved by the adaptive weighting scheme.
Keywords :
text analysis; unsupervised learning; application-specific travel reservation corpus; automatic class induction; corpus-independent metric; heterogeneous news domain; semantic class induction; semantic similarity metrics; unsupervised combination; Computer science; Data mining; Induction generators; Information retrieval; Iterative algorithms; Natural languages; Ontologies; Speech; Text processing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
Type :
conf
DOI :
10.1109/SLT.2006.326823
Filename :
4123368
Link To Document :
بازگشت