DocumentCode
124141
Title
A New Sentence Similarity Method Based on a Three-Layer Sentence Representation
Author
Ferreira, Ricardo ; Dueire Lins, Rafael ; Freitas, Fred ; Avila, Beatriz ; Simske, Steven J. ; Riss, Marcelo
Author_Institution
Inf. Center, Fed. Univ. of Pernambuco, Recife, Brazil
Volume
1
fYear
2014
fDate
11-14 Aug. 2014
Firstpage
110
Lastpage
117
Abstract
Sentence similarity methods are used to assess the degree of likelihood between phrases. Many natural language applications such as text summarization, information retrieval, text categorization, and machine translation employ measures of sentence similarity. The existing approaches for this problem represent sentences as vectors of bag of words or the syntactic information of the words in the phrase. The likelihood between phrases is calculated by composing the similarity between the words in the sentences. Such schemes do not address two important concerns in the area, however: the semantic problem and the word order. This paper proposes a new sentence similarity measure that attempts to address such problems by taking into account the lexical, syntactic, and semantic analysis of sentences. The new similarity measure proposed outperforms the state of the art systems in around 6%, when tested using a standard and publically available dataset.
Keywords
natural language processing; text analysis; information retrieval; machine translation; natural language applications; semantic analysis; semantic problem; sentence similarity measure; sentence similarity method; syntactic information; text categorization; text summarization; three-layer sentence representation; word order; Companies; Insurance; Measurement; Resource description framework; Semantics; Syntactics; Vectors; Natural Language Processing; Sentence Representation; Sentence Similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Warsaw
Type
conf
DOI
10.1109/WI-IAT.2014.23
Filename
6927532
Link To Document