Title :
Enhancing search engines through utilization of visually emphasized terms
Author :
Larsen, Henrik Legind
Author_Institution :
Dept. of Comput. Sci. & Eng., Aalborg Univ., Esbjerg, Denmark
Abstract :
We present an approach to weighted indexing of documents in information retrieval systems and search engines utilizing visual emphasizing applied in the document texts. The significance of a term in characterizing the topic of a document depends both on the number of occurrences of the term in the page, and on the amount of visual emphasizing applied in the occurrences. We argue that the document discrimination degree of a term, as measured by the inverse document frequency, should be applied as the default importance of the term in a query. The approach was evaluated using a real world case set showing good performance and sensitivity to parameters as expected.
Keywords :
indexing; information retrieval; search engines; vocabulary; document discrimination degree; information retrieval; information retrieval systems; inverse document frequency; performance; search engines; term occurrences; visually emphasized terms; weighted document indexing; Aggregates; Computer science; Frequency measurement; Indexing; Information retrieval; Search engines; Web pages;
Conference_Titel :
Fuzzy Information Processing Society, 2002. Proceedings. NAFIPS. 2002 Annual Meeting of the North American
Print_ISBN :
0-7803-7461-4
DOI :
10.1109/NAFIPS.2002.1018117