• DocumentCode
    3122220
  • Title

    Distributed Structural Relaxation of XPath Queries

  • Author

    Koloniari, Georgia ; Pitoura, Evaggelia

  • Author_Institution
    Comput. Sci. Dept., Univ. of Ioannina, Ioannina
  • fYear
    2009
  • fDate
    March 29 2009-April 2 2009
  • Firstpage
    529
  • Lastpage
    540
  • Abstract
    Due to the structural heterogeneity of XML, queries are often interpreted approximately. This is achieved by relaxing the query and ranking the results based on their relevance to the original query. Query relaxation over distributed XML repositories may incur large communication costs, since partial result lists from different sites need to be gathered and ranked to assembly the overall top-k results. To process such queries efficiently, we propose using a distributed clustered index to group documents based on their structural similarity. The clustered index proves to be very effective in reducing the sizes of the partial lists that need to be combined. Furthermore, it can be used as the basis of a pay-as-you-go approach, where clusters of documents are accessed gradually providing the user with increasingly improving results. To reduce the cost of constructing and maintaining the clustered index, we use a compact data structure that trades-off accuracy for storage and communication efficiency. The index is also used for selectivity estimation so that query relaxation is geared towards the most promising structural transformations. Our experimental results show that our approach significantly reduces the communication cost for retrieving the top-k results, while maintaining a low construction cost for the clustered index.
  • Keywords
    XML; distributed databases; query processing; XPath queries; compact data structure; distributed XML repositories; distributed clustered index; distributed structural relaxation; pay-as-you-go approach; structural heterogeneity; Assembly; Computer science; Costs; Data engineering; Data structures; Distributed processing; Internet; Query processing; Scalability; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    1084-4627
  • Print_ISBN
    978-1-4244-3422-0
  • Electronic_ISBN
    1084-4627
  • Type

    conf

  • DOI
    10.1109/ICDE.2009.110
  • Filename
    4812432