Title :
Exploiting structure for intelligent Web search
Author_Institution :
Dept. of Comput. Sci., Essex Univ., Colchester, UK
Abstract :
Together with the rapidly growing amount of online data, we register an immense need for intelligent search engines that access a restricted amount of data as found in intranets or other limited domains. These sorts of search engine must go beyond simple keyword indexing/matching, but they also have to be easily adaptable to new domains without huge costs. The paper presents a mechanism that addresses both of these points: first of all, the internal document structure is being used to extract concepts which impose a directory-like structure on the documents, similar to those found in classified directories. Furthermore, this is done in an efficient way which is largely language independent and does not make assumptions about the document structure.
Keywords :
document handling; human factors; information resources; interactive systems; knowledge based systems; search engines; classified directories; directory-like structure; document structure; intelligent Web search; intelligent search engines; internal document structure; intranets; language independence; online data; simple keyword indexing/matching; Computer science; Costs; Data mining; Indexing; Intelligent structures; Pattern matching; Registers; Search engines; Web pages; Web search;
Conference_Titel :
System Sciences, 2001. Proceedings of the 34th Annual Hawaii International Conference on
Conference_Location :
Maui, HI, USA
Print_ISBN :
0-7695-0981-9
DOI :
10.1109/HICSS.2001.926474