DocumentCode
2735985
Title
A new search method for ranking short text messages using semantic features and cluster coherence
Author
Trifan, Mircea ; Ionescu, Dan
Author_Institution
SITE, Univ. of Ottawa, Ottawa, ON, Canada
fYear
2010
fDate
27-29 May 2010
Firstpage
643
Lastpage
648
Abstract
A search results ranking method that uses semantic features and a cluster coherence measure is introduced in this paper. The quality of the returned search results is improved by grouping semantically related texts into clusters displayed in descending cluster size order. First the term-document matrix is constructed where the documents correspond to individual texts. Then, nonnegative matrix factorization (NMF) is used to group the texts into semantically related clusters. Only those clusters whose coherence is greater than a threshold value are displayed. In this way trending conceptually similar texts that re-occur in the input of multiple users are identified. The advantage of this approach compared to other methods [6] consists in the fact that the clusters in the approach introduced in this paper are computed by semantic similarity and not only by texts counters.
Keywords
Clustering algorithms; Counting circuits; Data mining; Fabrics; Navigation; Noise figure; Search engines; Search methods; Social network services; Twitter;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Cybernetics and Technical Informatics (ICCC-CONTI), 2010 International Joint Conference on
Conference_Location
Timisoara, Romania
Print_ISBN
978-1-4244-7432-5
Type
conf
DOI
10.1109/ICCCYB.2010.5491333
Filename
5491333
Link To Document