DocumentCode
3338647
Title
Internet medicine information monitoring system based on focused crawler
Author
Yan, Hong-yi ; Hao, Ping
Author_Institution
Coll. of Inf. Eng., Zhejiang Univ. of Technol., Hangzhou, China
fYear
2010
fDate
23-25 June 2010
Firstpage
452
Lastpage
456
Abstract
Aiming at the problem, which it is difficult to monitor the medicine trade information on Internet, proposed a combined strategy that searched specific topic on the Internet based on analyzing focused crawler´s searching algorithm. The combined strategy includes page-searching and relativity analysis. Page relativity algorithm adopts improved Fish-Search algorithm; Relativity analysis adopts distributed algorithm, hereinto the first step makes use of Vector space model (VSM) algorithm to find out the great topic in the rough. The second step adopts improved Native bayes classification algorithm to select the correlative small topic from the previous step´s result. On basis of researching, develops an information monitoring system facing the medicine on Internet. By testing the data of some websites and forums´ page, the result shows, the combined searching strategy improves the harvest ratio and small topic search´s efficiency of the focused crawler system.
Keywords
Algorithm design and analysis; Computer science; Computerized monitoring; Crawlers; Educational institutions; Functional analysis; Information analysis; Internet; System testing; Uniform resource locators; Distributed relativity algorithm; Fish-Search algorithm; Focused crawler;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Sciences and Interaction Sciences (ICIS), 2010 3rd International Conference on
Conference_Location
Chengdu, China
Print_ISBN
978-1-4244-7384-7
Electronic_ISBN
978-1-4244-7386-1
Type
conf
DOI
10.1109/ICICIS.2010.5534784
Filename
5534784
Link To Document