DocumentCode :
47738
Title :
Similarity Preserving Snippet-Based Visualization of Web Search Results
Author :
Gomez-Nieto, Erick ; San Roman, Frizzi ; Pagliosa, Paulo ; Casaca, Wallace ; Helou, Elias S. ; de Oliveira, Maria Cristina F. ; Nonato, Luis Gustavo
Author_Institution :
Inst. de Cienc. Mat. e de Comput. (ICMC), Univ. de Sao Paulo, Sao Carlos, Brazil
Volume :
20
Issue :
3
fYear :
2014
fDate :
Mar-14
Firstpage :
457
Lastpage :
470
Abstract :
Internet users are very familiar with the results of a search query displayed as a ranked list of snippets. Each textual snippet shows a content summary of the referred document (or webpage) and a link to it. This display has many advantages, for example, it affords easy navigation and is straightforward to interpret. Nonetheless, any user of search engines could possibly report some experience of disappointment with this metaphor. Indeed, it has limitations in particular situations, as it fails to provide an overview of the document collection retrieved. Moreover, depending on the nature of the query for example, it may be too general, or ambiguous, or ill expressed the desired information may be poorly ranked, or results may contemplate varied topics. Several search tasks would be easier if users were shown an overview of the returned documents, organized so as to reflect how related they are, content wise. We propose a visualization technique to display the results of web queries aimed at overcoming such limitations. It combines the neighborhood preservation capability of multidimensional projections with the familiar snippet-based representation by employing a multidimensional projection to derive two-dimensional layouts of the query search results that preserve text similarity relations, or neighborhoods. Similarity is computed by applying the cosine similarity over a "bag-of-wordsâ\´ vector representation of collection built from the snippets. If the snippets are displayed directly according to the derived layout, they will overlap considerably, producing a poor visualization. We overcome this problem by defining an energy functional that considers both the overlapping among snippets and the preservation of the neighborhood structure as given in the projected layout. Minimizing this energy functional provides a neighborhood preserving two-dimensional arrangement of the textual snippets with minimum overlap. The resulting visualization conveys both a - lobal view of the query results and visual groupings that reflect related results, as illustrated in several examples shown.
Keywords :
Internet; data visualisation; document handling; query processing; search engines; Internet; Web page; Web queries; Web search results; document collection retrieval; energy functional; query search; referred document; search engines; search query; similarity preserving snippet based visualization; text similarity; textual snippet; Layout; Navigation; Optimization; Search engines; Vectors; Visualization; Web pages; Multidimensional projection; web search visualization;
fLanguage :
English
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
Publisher :
ieee
ISSN :
1077-2626
Type :
jour
DOI :
10.1109/TVCG.2013.242
Filename :
6629989
Link To Document :
بازگشت