DocumentCode :
2614024
Title :
WebReader: a mechanism for automating the search and collecting information from the World Wide Web
Author :
Chen, J.C.Y. ; Li, Qing
Author_Institution :
Dept. of Tech. Dev., Reuters Asia Pte Ltd., China
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
47
Abstract :
Current Web search engines are based on keyword search, and relevance of a web page is dependent on the number of hit count on the keywords. As keyword matching is not at the same level as semantic matching, the searching scope is unnecessarily broad and the precision (and recall) can be rather low. These problems give rise to undesirable performance on web information searching. In this paper, we describe a mechanism called WebReader, which is a middleware between the browser and the Web for automating the search and collecting information from the Web. By facilitating meta-data specification in XML and manipulation in XSL, WebReader provides the users with a centralized, structured, and categorized means to specify and Web information. An experimental prototype based on XML, XSL and Java has been developed to show the feasibility and practicality of our approach through a real-life application example
Keywords :
client-server systems; search engines; Web search engines; WebReader; meta-data specification; middleware; search automation; Asia; Databases; Degradation; Information filtering; Information filters; Keyword search; Search engines; Web pages; Web sites; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems Engineering, 2000. Proceedings of the First International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-0577-5
Type :
conf
DOI :
10.1109/WISE.2000.882853
Filename :
882853
Link To Document :
بازگشت