• DocumentCode
    2330652
  • Title

    On text-based estimation of document relevance

  • Author

    Savia, Eerika ; Kaski, Samuel ; Tuulos, Ville ; Myllymäki, Petri

  • Author_Institution
    Neural Networks Res. Centre, Helsinki Univ. of Technol., Finland
  • Volume
    4
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    3275
  • Abstract
    This work is part of a proactive information retrieval project that aims at estimating relevance from implicit user feedback. The noisy feedback signal needs to be complemented with all available information, and textual content is one of the natural sources. Here we take the first steps by investigating whether this source is at all useful in the challenging setting of estimating the relevance of a new document based on only few samples with known relevance. It turns out that even sophisticated unsupervised methods like multinomial PCA (or latent Dirichlet allocation) cannot help much. By contrast, feature extraction supervised by relevant auxiliary data may help.
  • Keywords
    document handling; feature extraction; feedback; information retrieval; document relevance estimation; feature extraction; implicit user feedback; noisy feedback signal; proactive information retrieval project; Computer networks; Computer science; Content based retrieval; Electronic mail; Feature extraction; Feedback; Information retrieval; Information technology; Principal component analysis; Usability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • Conference_Location
    Budapest
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1381204
  • Filename
    1381204