DocumentCode
604481
Title
Mapping texts into graphs: An improved text similarity algorithm
Author
Zuoguo Liu ; Xiaorong Chen
Author_Institution
Coll. of Comput. Sci. & Inf., Guizhou Univ., Guiyang, China
fYear
2012
fDate
29-31 Dec. 2012
Firstpage
1357
Lastpage
1361
Abstract
An improved graph-based text similarity (GBTS) algorithm which based on original GBTS is illustrated in this paper. A text is mapped into a graph which consists of terms as its nodes and term sequences as its undirected edges. The Maximum Common Subgraph (MCS) of two graphs is useful for analyzing their similarity. What´s more, the similarity of two texts is divided into two parts: nodes similarity and edges similarity. Each part is calculated respectively and text similarity is the sum of two parts. At last, the improved algorithm is compared with the original one.
Keywords
graph theory; text analysis; GBTS algorithm; MCS; edge similarity; graph-based text similarity algorithm; maximum common subgraph; node similarity; term sequences; text mapping; undirected edges; graph theory; mapped graph; maximum common subgraph;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Network Technology (ICCSNT), 2012 2nd International Conference on
Conference_Location
Changchun
Print_ISBN
978-1-4673-2963-7
Type
conf
DOI
10.1109/ICCSNT.2012.6526173
Filename
6526173
Link To Document