• DocumentCode
    2735985
  • Title

    A new search method for ranking short text messages using semantic features and cluster coherence

  • Author

    Trifan, Mircea ; Ionescu, Dan

  • Author_Institution
    SITE, Univ. of Ottawa, Ottawa, ON, Canada
  • fYear
    2010
  • fDate
    27-29 May 2010
  • Firstpage
    643
  • Lastpage
    648
  • Abstract
    A search results ranking method that uses semantic features and a cluster coherence measure is introduced in this paper. The quality of the returned search results is improved by grouping semantically related texts into clusters displayed in descending cluster size order. First the term-document matrix is constructed where the documents correspond to individual texts. Then, nonnegative matrix factorization (NMF) is used to group the texts into semantically related clusters. Only those clusters whose coherence is greater than a threshold value are displayed. In this way trending conceptually similar texts that re-occur in the input of multiple users are identified. The advantage of this approach compared to other methods [6] consists in the fact that the clusters in the approach introduced in this paper are computed by semantic similarity and not only by texts counters.
  • Keywords
    Clustering algorithms; Counting circuits; Data mining; Fabrics; Navigation; Noise figure; Search engines; Search methods; Social network services; Twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Cybernetics and Technical Informatics (ICCC-CONTI), 2010 International Joint Conference on
  • Conference_Location
    Timisoara, Romania
  • Print_ISBN
    978-1-4244-7432-5
  • Type

    conf

  • DOI
    10.1109/ICCCYB.2010.5491333
  • Filename
    5491333