Title :
Text Document Clustering Based on the Modifying Relations
Author :
Weixin, Tian ; Fuxi, Zhu
Author_Institution :
Comput. Sch., Wuhan Univ., Wuhan
Abstract :
Text document clustering plays an important role in the modern knowledge management. This paper addresses the task of developing an effective and efficient method of clustering the text document. To meet this requirement, we first extract the modifying relations (MR) from the sentences and then organize them as feature set for representing the document. A novel similarity measure is proposed on the basis of MR-vectors in this paper. We use agglomerative hierarchical clustering algorithm in the experimental work and compare the results with other previous studies.
Keywords :
data structures; document handling; pattern clustering; text analysis; agglomerative hierarchical clustering algorithm; document representation; knowledge management; modifying relations; text document clustering; Algorithm design and analysis; Clustering algorithms; Clustering methods; Computer science; Data mining; Educational institutions; Information technology; Knowledge management; Software engineering; Text mining; Text mining; clustering; document representaion; modifying relation;
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
DOI :
10.1109/CSSE.2008.1545