Title :
Classification and Evaluation of Online Indexing Strategies
Author :
Hu, Rui ; Zhang, Xiang ; Wang, Peng
Author_Institution :
Coll. of Software Eng., Southeast Univ., Nanjing, China
Abstract :
Most search engines have to face the dynamic nature of the web, and it becomes a big problem that how to offer near real-time query service while the underlying document collection increases dramatically every day. As a result, the online indexing approaches become one of the kernel research problems of information retrieval. In this paper, we first present a detailed classification of various online indexing strategies, from the classics to the state-of-the-arts. We then perform an evaluation on selected strategies. A new evaluation metric is introduced in this paper to characterize the dynamic performance when queries interact with online indexing concurrently. Evaluation results characterize the performance differences among strategies and indicate the future improvements on update and query performance.
Keywords :
Internet; document handling; indexing; information retrieval; pattern classification; search engines; Web; classification; document collection; information retrieval; kernel research problems; online indexing strategies; real-time query service; search engines; Educational institutions; HTML; Indexing; Internet; Merging; Partitioning algorithms; Massive Text Data; Merge; Online Index; Query;
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2011 International Conference on
Conference_Location :
Chung-Li
Print_ISBN :
978-1-4577-2174-8
DOI :
10.1109/TAAI.2011.48