• Title of article

    Contextualization models for XML retrieval

  • Author/Authors

    Paavo Arvola، نويسنده , , Jaana Kek?l?inen، نويسنده , , Marko Junkkari، نويسنده ,

  • Issue Information
    دوماهنامه با شماره پیاپی سال 2011
  • Pages
    15
  • From page
    762
  • To page
    776
  • Abstract
    In a hierarchical XML structure, surrounding elements form the context of an XML element. In document-oriented XML, the context is a part of the semantics of the element and augments its textual information. The process of taking the context of the element into account in element scoring is called contextualization. This study extends the concept of contextualization and presents a classification of contextualization models. In an XML collection, elements are of different granularity, i.e. lower level elements are shorter and carry less textual information. Thus, it seems credible that contextualization interacts differently with diverse elements. Even if it is known that contextualization leads to improved effectiveness in element retrieval, the improvement on different granularity levels has not been investigated. This study explores the effect of contextualization on these levels. Further, a parameterized framework for testing contextualization is presented. The empirical part of the study is carried out in a traditional laboratory setting, where an XML collection is granulated. This is necessary in order to measure performance separately at different hierarchy levels. The results confirm the effectiveness of contextualization, and show how the elements of different granularities benefit from contextualization.
  • Keywords
    semi-structured data , Structured documents , Contextualization , Evaluation , Granularity level , Granulation , XML , Content element
  • Journal title
    Information Processing and Management
  • Serial Year
    2011
  • Journal title
    Information Processing and Management
  • Record number

    1229164