DocumentCode :
529363
Title :
Visualizing similar text documents based on 3D dendrogram
Author :
Kinoshita, Tomohito ; Ohkubo, Tomoyuki ; Kobayashil, Kazuyuki ; Watanabe, Kajiro ; Kurihara, Yosuke
Author_Institution :
Fac. of Eng., Hosei Univ., Tokyo, Japan
fYear :
2010
fDate :
18-21 Aug. 2010
Firstpage :
1285
Lastpage :
1288
Abstract :
This paper describes a new data visualization method using a three-dimensional dendrogram to And the relationship between similar text documents. For the detection of similarities in documents, we introduce two basic evaluation functions, "LCS" and "SED." We constructed and developed the proposed algorithm in MATLAB language, and conducted a comparison measurement and examined the evaluations. The validity of the proposed method can be verified by applying actual documents to demonstrate the 3D visualization.
Keywords :
data visualisation; mathematics computing; text analysis; LCS; MATLAB language; SED; data visualization method; longest common subsequence; shortest edit distance; similar text document visuallization; three-dimensional dendrogram; Algorithm design and analysis; Clustering algorithms; Data visualization; Detection algorithms; Software algorithms; Three dimensional displays; Visualization; 3D visualization; LCS; SED; dendrogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE Annual Conference 2010, Proceedings of
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-7642-8
Type :
conf
Filename :
5602599
Link To Document :
بازگشت