DocumentCode
2910296
Title
Search Results Clustering Based on a Linear Weighting Method of Similarity
Author
Zheng, Dequan ; Liu, Haibo ; Zhao, Tiejun
Author_Institution
MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin, China
fYear
2011
fDate
15-17 Nov. 2011
Firstpage
123
Lastpage
126
Abstract
The cluster of search results can facilitate users in finding the needed from massive information. But the effect of the traditional text clustering has been verified not good enough. Lingo Algorithm, which adopts LSI for clustering, generates candidate labels first, then distributes the documents, and forms the clusters finally. On the basis of Lingo Algorithm, this paper presents a linear weighted method of Single-Pass improvement, which integrates HowNet semantic similarity and cosine similarity, fuses and rediscovers clusters, and extracting the cluster labels. The experiments have showed that our method it achieves a good results in clusters in the form of purity and F-measure.
Keywords
information needs; pattern clustering; search problems; text analysis; F-measure; LSI; Lingo algorithm; document handling; information need; linear weighting method; pattern clustering; semantic similarity; text clustering; Clustering algorithms; Feature extraction; Fuses; Matrix decomposition; Search engines; Semantics; Vectors; Cosine similarity; Information retrieval; Lingo algorithm; Semantic similarity; Text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location
Penang
Print_ISBN
978-1-4577-1733-8
Type
conf
DOI
10.1109/IALP.2011.72
Filename
6121485
Link To Document