Title :
A new search method for ranking short text messages using semantic features and cluster coherence
Author :
Trifan, Mircea ; Ionescu, Dan
Author_Institution :
SITE, Univ. of Ottawa, Ottawa, ON, Canada
Abstract :
A search results ranking method that uses semantic features and a cluster coherence measure is introduced in this paper. The quality of the returned search results is improved by grouping semantically related texts into clusters displayed in descending cluster size order. First the term-document matrix is constructed where the documents correspond to individual texts. Then, nonnegative matrix factorization (NMF) is used to group the texts into semantically related clusters. Only those clusters whose coherence is greater than a threshold value are displayed. In this way trending conceptually similar texts that re-occur in the input of multiple users are identified. The advantage of this approach compared to other methods [6] consists in the fact that the clusters in the approach introduced in this paper are computed by semantic similarity and not only by texts counters.
Keywords :
Clustering algorithms; Counting circuits; Data mining; Fabrics; Navigation; Noise figure; Search engines; Search methods; Social network services; Twitter;
Conference_Titel :
Computational Cybernetics and Technical Informatics (ICCC-CONTI), 2010 International Joint Conference on
Conference_Location :
Timisoara, Romania
Print_ISBN :
978-1-4244-7432-5
DOI :
10.1109/ICCCYB.2010.5491333