• DocumentCode
    2709386
  • Title

    Visual Data Alignment for Search Engine Results Pages

  • Author

    Hong, Jer Lang ; Siew, Eu-Gene ; Egerton, Simon

  • fYear
    2010
  • fDate
    7-10 May 2010
  • Firstpage
    141
  • Lastpage
    145
  • Abstract
    Visual wrappers use visual information in addition to the DOM Tree properties in the extraction of data records. However, a closer look indicates that visual information can also be used to align data records into tabular form. In this paper, we propose a data alignment algorithm to align data records using DOM Tree properties and visual cue of data records. Our data alignment algorithm uses a regular expression rule and incorporates visual cue such as relative position and size of data items to provide options for the alignment of iterative and disjunctive data items. Results show that our wrapper performs better than existing state of the art wrappers.
  • Keywords
    data visualisation; records management; search engines; tree data structures; DOM tree properties; data alignment algorithm; disjunctive data items; iterative data items; regular expression rule; search engine results pages; visual cue; visual data alignment; visual wrappers; Cats; Data mining; HTML; Information retrieval; Iterative algorithms; Positron emission tomography; Robustness; Search engines; Tree data structures; Visual perception; Data Alignment; Visual Information; Wrapper;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Research and Development, 2010 Second International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-0-7695-4043-6
  • Type

    conf

  • DOI
    10.1109/ICCRD.2010.78
  • Filename
    5489484