• DocumentCode
    2322136
  • Title

    Can Collective Use Help for Searching?

  • Author

    Dicheva, Darina ; Dichev, Christo

  • Author_Institution
    Comput. Sci. Dept., Winston-Salem State Univ., Winston Salem, NC, USA
  • fYear
    2011
  • fDate
    10-12 Oct. 2011
  • Firstpage
    24
  • Lastpage
    31
  • Abstract
    In this paper we propose a "find similar" method intended to extend the searching capabilities of digital collections targeting educational and academic domains. Given a document, the described algorithm finds similar documents that may be of interest to the user. It exploits the metadata typical for the participatory web. In the adopted model, documents are viewed as objects associated with a set of tags and a set of users who have tagged them, inducing tag-based and user-based similarity. The similarity between two documents is computed as a combination of their tag-base and, user-based cosine similarity and the document recency. We have con-ducted a series of experiments using a CiteULike dump to investigate the properties of the proposed similarity measure. The experimental results indicate that the algorithm exploiting meta-information about the documents provides a good approximation of our understanding of the contextual dependency of the notion of similarity.
  • Keywords
    Internet; document handling; identification technology; meta data; pattern matching; query formulation; CiteULike dump; Web participatory; academic domain; contextual dependency; digital collection; document recency; educational domain; metadata; searching capability; tag-based similarity; user-based cosine similarity; Accuracy; Bipartite graph; Collaboration; Complex networks; Humans; Tagging; Vectors; finding similar documents; folksonomy; information retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011 International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4577-1827-4
  • Type

    conf

  • DOI
    10.1109/CyberC.2011.14
  • Filename
    6079398