DocumentCode
2322136
Title
Can Collective Use Help for Searching?
Author
Dicheva, Darina ; Dichev, Christo
Author_Institution
Comput. Sci. Dept., Winston-Salem State Univ., Winston Salem, NC, USA
fYear
2011
fDate
10-12 Oct. 2011
Firstpage
24
Lastpage
31
Abstract
In this paper we propose a "find similar" method intended to extend the searching capabilities of digital collections targeting educational and academic domains. Given a document, the described algorithm finds similar documents that may be of interest to the user. It exploits the metadata typical for the participatory web. In the adopted model, documents are viewed as objects associated with a set of tags and a set of users who have tagged them, inducing tag-based and user-based similarity. The similarity between two documents is computed as a combination of their tag-base and, user-based cosine similarity and the document recency. We have con-ducted a series of experiments using a CiteULike dump to investigate the properties of the proposed similarity measure. The experimental results indicate that the algorithm exploiting meta-information about the documents provides a good approximation of our understanding of the contextual dependency of the notion of similarity.
Keywords
Internet; document handling; identification technology; meta data; pattern matching; query formulation; CiteULike dump; Web participatory; academic domain; contextual dependency; digital collection; document recency; educational domain; metadata; searching capability; tag-based similarity; user-based cosine similarity; Accuracy; Bipartite graph; Collaboration; Complex networks; Humans; Tagging; Vectors; finding similar documents; folksonomy; information retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011 International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4577-1827-4
Type
conf
DOI
10.1109/CyberC.2011.14
Filename
6079398
Link To Document