• DocumentCode
    140986
  • Title

    A tool for Internet-scale cardinality estimation of XPath queries over distributed semistructured data

  • Author

    Slavov, Vasil ; Katib, Anas ; Rao, Prahlada

  • Author_Institution
    Univ. of Missouri-Kansas City, Kansas City, MO, USA
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    1270
  • Lastpage
    1273
  • Abstract
    We present a novel tool called XGossip for Internet-scale cardinality estimation of XPath queries over distributed XML data. XGossip relies on the principle of gossip, is scalable, decentralized, and can cope with network churn and failures. It employs a novel divide-and-conquer strategy for load balancing and reducing the overall network bandwidth consumption. It has a strong theoretical underpinning and provides provable guarantees on the accuracy of cardinality estimates, the number of messages exchanged, and the total bandwidth usage. In this demonstration, users will experience three engaging scenarios: In the first scenario, they can set up, configure, and deploy XGossip on Amazon Elastic Compute Cloud (EC2). In the second scenario, they can execute XGossip, pose XPath queries, observe in real-time the convergence speed of XGossip, the accuracy of cardinality estimates, the bandwidth usage, and the number of messages exchanged. In the third scenario, they can introduce network churn and failures during the execution of XGossip and observe how these impact the behavior of XGossip.
  • Keywords
    Internet; XML; divide and conquer methods; query processing; resource allocation; Amazon Elastic Compute Cloud; EC2; Internet-scale cardinality estimation; XGossip tool; XPath queries; bandwidth usage; distributed XML data; distributed semistructured data; divide-and-conquer strategy; extensible markup language; load balancing; message exchange; network bandwidth consumption; network churn; Accuracy; Bandwidth; Convergence; Engines; Estimation; Medical services; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816758
  • Filename
    6816758