DocumentCode :
1880232
Title :
A highly efficient distributed indexing system based on large cluster of commodity machines
Author :
Pole, Govind S. ; Potey, Madhuri A.
Author_Institution :
Dept. of Comput. Eng., D.Y. Patil Coll. of Eng., Pune, India
fYear :
2012
fDate :
20-22 Sept. 2012
Firstpage :
1
Lastpage :
4
Abstract :
An Information Retrieval System using centralized approach demands long time to update the web index. A highly efficient distributed indexing system operates on large & diverse datasets with optimum time consumption compared to centralized approach to update web index. In this paper, a prototype model of highly efficient distributed indexing system deployed to run on cluster of commodity machines for the creation of large index using functionality of Apache Lucene. Experimental results showed efficiency of distributed indexing process. This distributed approach helps to reduce time interval for index creation and updation, in turn keeps the index content more fresh.
Keywords :
Internet; indexing; information retrieval systems; Apache Lucene functionality; Web index update; centralized approach; commodity machine cluster; highly efficient distributed indexing system; index content; information retrieval system; large index creation; Computers; Educational institutions; Indexing; Search engines; Standards; Web pages; commodity computing; dataset; distributed indexers; lucene; parser; retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless and Optical Communications Networks (WOCN), 2012 Ninth International Conference on
Conference_Location :
Indore
ISSN :
2151-7681
Print_ISBN :
978-1-4673-1988-1
Type :
conf
DOI :
10.1109/WOCN.2012.6335562
Filename :
6335562
Link To Document :
بازگشت