DocumentCode
3230920
Title
Effective Page Segmentation Combining Pattern Analysis and Visual Separators for Browsing on Small Screens
Author
Xiang, Peifeng ; Yang, Xin ; Shi, Yuanchun
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing
fYear
2006
fDate
18-22 Dec. 2006
Firstpage
831
Lastpage
840
Abstract
Page segmentation plays a key role in browsing on small screens. It breaks a large page into smaller segments according to their semantic relationships. Then, various approaches such as single column adaptation and thumbnail view with zooming links can be implemented based on these page segments. However, for current flexible Web pages, segmentation remains a challenging task. This paper proposes an effective automatic segmentation method which combining pattern analysis and visual separators. The basic idea is that a page´s semantic structure is largely reflected by repeated continuous patterns and visual separators, which coincides with human´s visual perception. The proposed method works in three steps: generating a refined tag tree from the DOM tree, recognizing and merging inexact patterns recursively, and segmenting the others by visual separators. Our experimental results show that the proposed method outperforms existing methods, especially for pages automatically generated from templates
Keywords
Internet; distributed object management; image segmentation; pattern recognition; Web page; automatic segmentation method; column adaptation; page segmentation; pattern analysis; visual perception; visual separation; Computer science; HTML; Intelligent structures; Merging; Particle separators; Pattern analysis; Pattern recognition; Tree data structures; Visual perception; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
Conference_Location
Hong Kong
Print_ISBN
0-7695-2747-7
Type
conf
DOI
10.1109/WI.2006.67
Filename
4061481
Link To Document