• DocumentCode
    140898
  • Title

    Breaking out of the MisMatch trap

  • Author

    Yong Zeng ; Zhifeng Bao ; Tok Wang Ling ; Jagadish, H.V. ; Guoliang Li

  • Author_Institution
    Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    940
  • Lastpage
    951
  • Abstract
    When users issue a query to a database, they have expectations about the results. If what they search for is unavailable in the database, the system will return an empty result or, worse, erroneous mismatch results.We call this problem the MisMatch Problem. In this paper, we solve the MisMatch problem in the context of XML keyword search. Our solution is based on two novel concepts that we introduce: Target Node Type and Distinguishability. Using these concepts, we develop a low-cost post-processing algorithm on the results of query evaluation to detect the MisMatch problem and generate helpful suggestions to users. Our approach has three noteworthy features: (1) for queries with the MisMatch problem, it generates the explanation, suggested queries and their sample results as the output to users, helping users judge whether the MisMatch problem is solved without reading all query results; (2) it is portable as it can work with any LCA-based matching semantics and is orthogonal to the choice of result retrieval method adopted; (3) it is lightweight in the way that it occupies a very small proportion of the whole query evaluation time. Extensive experiments on three real datasets verify the effectiveness, efficiency and scalability of our approach. A search engine called XClear has been built and is available at http://xclear.comp.nus.edu.sg.
  • Keywords
    XML; query processing; search engines; LCA-based matching semantics; XClear; XML keyword search; database querying; distinguishability; erroneous mismatch results; low-cost post-processing algorithm; mismatch problem; mismatch trap; query evaluation time; result retrieval method; search engine; target node type; Context; Image color analysis; Keyword search; Manganese; Portable computers; Semantics; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816713
  • Filename
    6816713