• DocumentCode
    2929464
  • Title

    Content-only querying structured contexts using formal concept analysis

  • Author

    Zerarga, Loutfi ; Djouadi, Yassine

  • Author_Institution
    Fac. des Sci., Univ. M´hamed Bouguara de Boumerdes, Boumerdès, Algeria
  • fYear
    2013
  • fDate
    22-24 April 2013
  • Firstpage
    58
  • Lastpage
    64
  • Abstract
    With the large increase of collections of structured documents (e.g. XML, HTML, ...), the need to retrieve different granules (fragments, sub-units, ...) of such documents, instead of the whole structure, becomes obvious. Nowadays, all existing Formal Concept Analysis-based Information Retrieval approaches address exclusively unstructured documents. They rely on the use of dyadic formal contexts (i.e. binary Documents × Terms relations). In this paper an original approach which consists of enlarging FCA-based IR paradigm to structured documents is proposed. Our approach stands from the idea of modeling structured documents by means of triadic formal contexts (i.e. ternary Documents × Terms × Structure relation). This allows to retrieve sub-units or fragments of structured documents. In structured information retrieval, queries may be of different types. This paper deals with content-only queries and gives a theoretical framework for both conjunctive as well as disjunctive forms.
  • Keywords
    document handling; formal concept analysis; query processing; FCA-based IR paradigm; content-only queries; content-only querying structured context; formal concept analysis-based information retrieval; structure relation; structured document collections; structured document modeling; structured information retrieval; term relations; ternary documents; triadic formal contexts; Erbium; Iron;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Programming and Systems (ISPS), 2013 11th International Symposium on
  • Conference_Location
    Algiers
  • Print_ISBN
    978-1-4799-1152-3
  • Type

    conf

  • DOI
    10.1109/ISPS.2013.6581495
  • Filename
    6581495