Title :
Visual and textual summarization of webpages
Author :
Akhtar, Naheed ; Siddique, Bushra ; Afroz, Rounaque
Author_Institution :
Dept. of Comput. Eng., Aligarh Muslim Univ., Aligarh, India
Abstract :
Search and re-finding tasks are among the most typical applications on internet. In order to make these tasks more efficient, we propose an attractive new scheme to visually summarize web pages. This would allow users to quickly get the idea of what the webpage is all about and helping users to recall the visited web pages. For the purpose, we employ latent semantic analysis to prepare a gist of the textual contents accompanied by title and a relevant image. The image could be internal or external depending upon the contents of the webpage. If the webpage contains dominant internal images, ranking algorithm is employed to select the most dominant internal image for summarization. If not, key phrases are extracted from the textual part for the purpose of searching the entire internet for a relevant external image. After the image search results are obtained, re-ranking algorithm is applied to select the most appropriate image.
Keywords :
Internet; image retrieval; text analysis; Internet; Web pages; dominant internal image; external images; key phrase extraction; latent semantic analysis; ranking algorithm; reranking algorithm; textual contents; textual summarization; visual summarization; Feature extraction; Search engines; Semantics; Vectors; Visualization; Web pages; Web search; text summarization; visual summarization;
Conference_Titel :
Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-4675-4
DOI :
10.1109/ICDMIC.2014.6954267