Exploiting structure for intelligent Web search

Author

Kruschwitz, Udo

Author_Institution

Dept. of Comput. Sci., Essex Univ., Colchester, UK

fYear

2001

fDate

6-6 Jan. 2001

Abstract

Together with the rapidly growing amount of online data, we register an immense need for intelligent search engines that access a restricted amount of data as found in intranets or other limited domains. These sorts of search engine must go beyond simple keyword indexing/matching, but they also have to be easily adaptable to new domains without huge costs. The paper presents a mechanism that addresses both of these points: first of all, the internal document structure is being used to extract concepts which impose a directory-like structure on the documents, similar to those found in classified directories. Furthermore, this is done in an efficient way which is largely language independent and does not make assumptions about the document structure.

Keywords

document handling; human factors; information resources; interactive systems; knowledge based systems; search engines; classified directories; directory-like structure; document structure; intelligent Web search; intelligent search engines; internal document structure; intranets; language independence; online data; simple keyword indexing/matching; Computer science; Costs; Data mining; Indexing; Intelligent structures; Pattern matching; Registers; Search engines; Web pages; Web search;

fLanguage

English

Publisher

ieee

Conference_Titel

System Sciences, 2001. Proceedings of the 34th Annual Hawaii International Conference on

Conference_Location

Maui, HI, USA

Print_ISBN

0-7695-0981-9

Type

conf

DOI

10.1109/HICSS.2001.926474

Filename

926474