• Title of article

    Document–document similarity approaches and science mapping: Experimental comparison of five approaches

  • Author/Authors

    Ahlgren، نويسنده , , Per and Colliander، نويسنده , , Cristian، نويسنده ,

  • Issue Information
    فصلنامه با شماره پیاپی سال 2009
  • Pages
    15
  • From page
    49
  • To page
    63
  • Abstract
    This paper treats document–document similarity approaches in the context of science mapping. Five approaches, involving nine methods, are compared experimentally. We compare text-based approaches, the citation-based bibliographic coupling approach, and approaches that combine text-based approaches and bibliographic coupling. Forty-three articles, published in the journal Information Retrieval, are used as test documents. We investigate how well the approaches agree with a ground truth subject classification of the test documents, when the complete linkage method is used, and under two types of similarities, first-order and second-order. The results show that it is possible to achieve a very good approximation of the classification by means of automatic grouping of articles. One text-only method and one combination method, under second-order similarities in both cases, give rise to cluster solutions that to a large extent agree with the classification.
  • Keywords
    Citation data , Textual data , Cluster analysis , Data source combination , Science mapping
  • Journal title
    Journal of Informetrics
  • Serial Year
    2009
  • Journal title
    Journal of Informetrics
  • Record number

    1387093