• DocumentCode
    511074
  • Title

    Exploring HTML Tags and Metadata to Improve the Expressiveness of Web Search Engine´s Queries

  • Author

    Escudeiro, Nuno Filipe ; Escudeiro, Paula Maria

  • Author_Institution
    Lab. de Intel. Artificial e Apoio a Decisao, Inst. Politec. do Porto, Porto, Portugal
  • Volume
    1
  • fYear
    2009
  • fDate
    28-30 Dec. 2009
  • Firstpage
    569
  • Lastpage
    573
  • Abstract
    Web search engines are powerful tools used to satisfy specific information needs on the Web. Their purpose is to maximize user satisfaction when performing this task. Although there are other sources of evidence, besides text, to characterize document relevance for a specific need, especially for HTML documents, current search engines do not allow users to explore these features when posing a query. Search engine queries are based almost exclusively on keywords. We believe that it is possible to improve user satisfaction if HTML tags and document metadata are available to users at query time. In this paper we present Xearch, a meta-search system that wraps public search engines in a framework that improves both the expressiveness of the language available for the user to specify information needs and the control over the answer format. Xearch converts HTML pages to a specific XML schema, covering text and metadata derived from HTML. User queries are then submitted on this schema and can be specified through keywords but also explore documents´ HTML tags and metadata. Results from our experimental evaluation confirm that it is possible to improve the answer quality with this framework.
  • Keywords
    Internet; XML; hypermedia markup languages; query processing; search engines; HTML documents; HTML tags; Web search engine queries; XML schema; Xearch; document metadata; meta-search system; Control systems; Database languages; HTML; Information retrieval; Metasearch; Power engineering computing; Search engines; Web pages; Web search; XML; XML; information retrieval; query language;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Electrical Engineering, 2009. ICCEE '09. Second International Conference on
  • Conference_Location
    Dubai
  • Print_ISBN
    978-1-4244-5365-8
  • Electronic_ISBN
    978-0-7695-3925-6
  • Type

    conf

  • DOI
    10.1109/ICCEE.2009.228
  • Filename
    5380181