DocumentCode
2948971
Title
Evaluation and classification of syntax information usage in determining short text semantic similarity
Author
Batanovic, Vuk ; Bojic, Dejan
Author_Institution
Elektroteh. Fak., Univ. u Beogradu, Belgrade, Serbia
fYear
2013
fDate
26-28 Nov. 2013
Firstpage
821
Lastpage
824
Abstract
This paper outlines and categorizes ways of using syntax information in a number of algorithms for determining short text semantic similarity. Algorithm performance was evaluated using the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. Among the described algorithms and approaches to using syntax information we identify those best suited for application in languages with limited electronic linguistic tools and, with that goal in mind, we propose a new algorithm classification.
Keywords
computational linguistics; natural language processing; pattern classification; text analysis; Microsoft Research Paraphrase Corpus; algorithm performance evaluation; electronic linguistic tools; paraphrase detection test; short-text semantic similarity determination; syntax information usage classification; syntax information usage evaluation; Coal; Computational linguistics; Electronic mail; Knowledge discovery; Labeling; Semantics; Syntactics;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications Forum (TELFOR), 2013 21st
Conference_Location
Belgrade
Print_ISBN
978-1-4799-1419-7
Type
conf
DOI
10.1109/TELFOR.2013.6716356
Filename
6716356
Link To Document