• DocumentCode
    2825375
  • Title

    Research and Implementation on Multi-Cues Based Page Segmentation Algorithm

  • Author

    Hu Yan ; Miao Miao

  • Author_Institution
    Comput. Sci. & Technol. Sch., Wuhan Univ. of Technol., Wuhan, China
  • fYear
    2009
  • fDate
    11-13 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In view of that the exiting segmentation algorithms use only one kind of cue, but while users viewing the Web page, human brain can handle many cues and segment the pages subconsciously. So this paper considers many kinds of cues, simulates the process of user-perception, analyzes the structure of Web pages and proposes the multi-cues based page segmentation algorithm. After the segmentation, semantic information is added to every semantic block.
  • Keywords
    Internet; information retrieval; Web page; human brain; multicues based page segmentation algorithm; semantic block; semantic information; user perception; Algorithm design and analysis; Brain modeling; Computer science; Content based retrieval; Data mining; HTML; Humans; Information retrieval; Internet; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4507-3
  • Electronic_ISBN
    978-1-4244-4507-3
  • Type

    conf

  • DOI
    10.1109/CISE.2009.5363822
  • Filename
    5363822