Title :
Web search result optimization by mining the search engine query logs
Author :
Sharma, Arvind Kumar ; Aggarwal, Neha ; Duhan, Neelam ; Gupta, Ranjna
Author_Institution :
Dept. of Comput. Eng., YMCA Univ. of Sci. & Technol., Faridabad, India
Abstract :
Modern Information Retrieval Systems match the terms of a user query with available documents in their index and return a large number of Web pages generally in the form of a ranked list. It becomes almost impractical at the user end to examine every returned document, thus necessitating the need to look for some means of result optimization. In this paper, a novel result optimization technique based on learning from historical query logs is being proposed, which predicts users´ information needs and reduces their navigation time within the result list. The method first performs query clustering in query logs based on a novel similarity function and then captures the sequential patterns of clicked web pages in each cluster using a sequential pattern mining algorithm. Finally, search result list is re-ranked by updating the existing PageRank values of pages using the discovered sequential patterns. The proposed work results in reduced search space as user intended pages tend to move upwards in the result list.
Keywords :
Internet; data mining; optimisation; pattern clustering; query processing; search engines; search problems; PageRank; Web pages; Web search result optimization; information retrieval system; query clustering; search engine; search space; sequential pattern mining algorithm; similarity function; user query logs; Databases; PageRank Algorithm; Query Clustering; Query Logs; Sequential Pattern Mining; Web;
Conference_Titel :
Methods and Models in Computer Science (ICM2CS), 2010 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4244-9701-0
DOI :
10.1109/ICM2CS.2010.5706716