DocumentCode :
2735985
Title :
A new search method for ranking short text messages using semantic features and cluster coherence
Author :
Trifan, Mircea ; Ionescu, Dan
Author_Institution :
SITE, Univ. of Ottawa, Ottawa, ON, Canada
fYear :
2010
fDate :
27-29 May 2010
Firstpage :
643
Lastpage :
648
Abstract :
A search results ranking method that uses semantic features and a cluster coherence measure is introduced in this paper. The quality of the returned search results is improved by grouping semantically related texts into clusters displayed in descending cluster size order. First the term-document matrix is constructed where the documents correspond to individual texts. Then, nonnegative matrix factorization (NMF) is used to group the texts into semantically related clusters. Only those clusters whose coherence is greater than a threshold value are displayed. In this way trending conceptually similar texts that re-occur in the input of multiple users are identified. The advantage of this approach compared to other methods [6] consists in the fact that the clusters in the approach introduced in this paper are computed by semantic similarity and not only by texts counters.
Keywords :
Clustering algorithms; Counting circuits; Data mining; Fabrics; Navigation; Noise figure; Search engines; Search methods; Social network services; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Cybernetics and Technical Informatics (ICCC-CONTI), 2010 International Joint Conference on
Conference_Location :
Timisoara, Romania
Print_ISBN :
978-1-4244-7432-5
Type :
conf
DOI :
10.1109/ICCCYB.2010.5491333
Filename :
5491333
Link To Document :
بازگشت