Title :
Research and Implementation on Multi-Cues Based Page Segmentation Algorithm
Author :
Hu Yan ; Miao Miao
Author_Institution :
Comput. Sci. & Technol. Sch., Wuhan Univ. of Technol., Wuhan, China
Abstract :
In view of that the exiting segmentation algorithms use only one kind of cue, but while users viewing the Web page, human brain can handle many cues and segment the pages subconsciously. So this paper considers many kinds of cues, simulates the process of user-perception, analyzes the structure of Web pages and proposes the multi-cues based page segmentation algorithm. After the segmentation, semantic information is added to every semantic block.
Keywords :
Internet; information retrieval; Web page; human brain; multicues based page segmentation algorithm; semantic block; semantic information; user perception; Algorithm design and analysis; Brain modeling; Computer science; Content based retrieval; Data mining; HTML; Humans; Information retrieval; Internet; Web pages;
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
DOI :
10.1109/CISE.2009.5363822