Title :
Organizing WWW images based on the analysis of page layout and Web link structure
Author :
Cai, Deng ; He, Xiuofei ; Ma, Wei-Ying ; Wen, Ji-Rong ; HongJiang Zhang
Author_Institution :
Microsoft Res. Asia, Beijing, China
Abstract :
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for an effective and efficient method of organizing and retrieving the images available. This paper describes a method for clustering and embedding WWW images. By using a vision-based page segmentation algorithm, a Web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By extracting the page-to-block, block-to-image, block-to-page relationships through a link structure and page layout analysis, we construct an image graph. With the image graph model, we use techniques from spectral graph theory for image clustering and embedding. Some experimental results are given in the paper.
Keywords :
content-based retrieval; image classification; image retrieval; semantic Web; text analysis; trees (mathematics); CBIR; WWW image organization; Web link structure analysis; Web page partitioning; block-to-image relationship; content-based image retrieval; digital images; image clustering; image embedding; image graph; image semantic relationships; page layout analysis; page-to-block relationship; semantic classes; spectral graph theory; textual information extraction; vision-based page segmentation; Clustering algorithms; Data mining; Digital images; Image analysis; Image retrieval; Image segmentation; Organizing; Partitioning algorithms; Web pages; World Wide Web;
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8603-5
DOI :
10.1109/ICME.2004.1394138