Title :
Architecture Design of Subject-Oriented Web Crawler
Author :
Cao Xin ; Zhang Yong ; Zhang Fuyan ; Ni Changbao
Author_Institution :
Comput. Sci. & Tech. Dept., Dalian Neusoft Univ. of Inf., Dalian, China
Abstract :
In response to the defects of traditional search engines of which it will return a large amount of information when users search keywords, and it´s hard to get the useful information, thus propose the thought of subject division, that is the subject-oriented search engine.Subject crawler is the key and unique components of the subject-based search engine. The structure of the crawler has a significant impact on the speed of Web resources, as well as the multi-machine distributed extended functionality. This paper studies and designed an architecture of subject crawler, which has flexible modular scalability and multi-machine distributed scalability, and elaborated it.
Keywords :
Internet; architectural CAD; information retrieval; search engines; Web resources; architecture design; flexible modular scalability; keyword search; multimachine distributed extended functionality; multimachine distributed scalability; subject-oriented Web crawler; subject-oriented search engines; Computer architecture; Crawlers; HTML; Instruction sets; Memory; Search engines; Sockets; Astronomical Image; Image Segmentation; Mutual Information; PCNN;
Conference_Titel :
Intelligent Systems Design and Engineering Applications, 2013 Fourth International Conference on
Conference_Location :
Zhangjiajie
Print_ISBN :
978-1-4799-2791-3
DOI :
10.1109/ISDEA.2013.444