• DocumentCode
    3263943
  • Title

    Dependence among terms in vector space model

  • Author

    Silva, Ilmério R. ; Souza, João Nunes ; Santos, Karina S.

  • Author_Institution
    Univ. Fed. de Uberlandia, Brazil
  • fYear
    2004
  • fDate
    7-9 July 2004
  • Firstpage
    97
  • Lastpage
    102
  • Abstract
    The vector space model is a mathematical-based model that represents terms, documents and queries by vectors and provides a ranking. In this model, the subspace of interest is formed by a set of pairwise orthogonal term vectors, indicating that terms are mutually independent. However, this is a simplification that doesn´t correspond to the reality. Based on this scenery, we present, in this work, an extension to the vector space model to take into account the correlation between terms. In the proposed model, term vectors are rotated in space geometrically reflecting the dependence semantics among terms. We rotate terms based on a data mining technique called association rules. The retrieval effectiveness of the proposed model is evaluated and the results shows that our model improves in average precision, relative to the standard vector space model, for all collections evaluated, leading to a gain up to 31%.
  • Keywords
    data mining; information retrieval; association rules; data mining; information retrieval; mathematical-based model; pairwise orthogonal term vectors; vector space model; Association rules; Data engineering; Data mining; Databases; Information retrieval; Mathematical model; Proposals; Set theory; Solid modeling; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International
  • ISSN
    1098-8068
  • Print_ISBN
    0-7695-2168-1
  • Type

    conf

  • DOI
    10.1109/IDEAS.2004.1319782
  • Filename
    1319782