DocumentCode
674936
Title
Browser with Clustering of Web Documents
Author
Tetali, Ravitheja ; Bose, Joy ; Arif, Tasleem
Author_Institution
WMG Group, Samsung Res. Inst. India Bangalore, Bangalore, India
fYear
2013
fDate
15-17 Dec. 2013
Firstpage
164
Lastpage
168
Abstract
Accessing relevant information quickly, given limited time and space, is a major issue in Web browsers, especially those in mobile devices. In this paper we propose a framework for grouping similar Web documents in a browser based on similar content of the browsed pages. This grouping can help reduce clutter and enable the user to access relevant Web information quickly. The algorithm we used for clustering is MajorClust, a document similarity algorithm based on tokenizing the words in the document and then determining a cosine similarity measure to estimate the distance between the words. The entire algorithm for clustering is implemented inside the browser without the need of an external Web server. We have implemented and tested the algorithm on a mobile browser and obtained accurate finer clustering of Web pages when compared to Alexa´s sub-categories.
Keywords
Internet; Web sites; mobile computing; online front-ends; pattern clustering; text analysis; MajorClust; Web browsers; Web documents clustering; Web information; Web pages clustering; cosine similarity measure; document similarity algorithm; document words tokenizing; mobile browser; Browsers; Clustering algorithms; Engines; History; Mobile communication; Mobile handsets; Web pages; MajorClust; Web browser; Web history; document clustering; intelligent browsing;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computing, Networking and Security (ADCONS), 2013 2nd International Conference on
Conference_Location
Mangalore
Type
conf
DOI
10.1109/ADCONS.2013.20
Filename
6714157
Link To Document