Title :
Automatic information discovery from the "invisible Web"
Author :
Lin, King-Ip ; Chen, Hui
Author_Institution :
Div. of Comput. Sci., Univ. of Memphis, TN, USA
Abstract :
A large amount of online information resides on the "invisible Web" - Web pages that are generated dynamically from databases and other data sources hidden from the user. They are not indexed by a static URL but are generated when queries are made via a search interface (a specialized search engine). In this paper, we propose a system that is capable of automatically making use of these specialized engines to find information on the invisible Web. We describe our overall architecture and process: from obtaining the search engines to picking the right engines to query. Experiments show that we can find information that is not found by traditional search engines.
Keywords :
data mining; information resources; search engines; World Wide Web; automatic information discovery; databases; dynamically generated Web pages; hidden data sources; information finding; invisible Web; online information; querying; search interface; specialized search engines; unindexed Web pages; Computer science; Databases; Encyclopedias; HTML; Indexes; Information retrieval; Search engines; Uniform resource locators; Web pages; Web sites;
Conference_Titel :
Information Technology: Coding and Computing, 2002. Proceedings. International Conference on
Print_ISBN :
0-7695-1506-1
DOI :
10.1109/ITCC.2002.1000411