Title :
Ontology-based automatic classification for Web pages: design, implementation and evaluation
Author :
Prabowo, Rudy ; Jackson, Mike ; Burden, Peter ; Knoell, Heinz-Dieter
Author_Institution :
Sch. of Comput. & Inf. Technol., Univ. of Wolverhampton, UK
Abstract :
In recent years, we have witnessed continual growth in the use of ontologies in order to provide a mechanism to enable machine reasoning. This paper describes an automatic classifier, which focuses on the use of ontologies for classifying Web pages with respect to Dewey Decimal Classification (DDC) and Library of Congress Classification (LCC) schemes. Firstly, we explain how these ontologies can be built in a modular fashion, and mapped into DDC and LCC. Secondly, we propose the formal definition of a DDC-LCC and an ontology-classification-scheme mapping. Thirdly, we explain the way the classifier uses these ontologies to assist classification. Finally, an experiment in which the accuracy of the classifier was evaluated is presented. The experiment shows that our approach results an improved classification in terms of accuracy. This improvement, however, comes at a cost in a low coverage ratio due to incompleteness of the ontologies used.
Keywords :
Internet; classification; inference mechanisms; information resources; information retrieval; knowledge representation; Dewey Decimal Classification; Library of Congress Classification; Web pages; accuracy; machine reasoning; ontology-based automatic classification; ontology-classification-scheme mapping; Costs; Data mining; Information retrieval; Libraries; Ontologies; Portals; Seals; Search engines; Terminology; Web pages;
Conference_Titel :
Web Information Systems Engineering, 2002. WISE 2002. Proceedings of the Third International Conference on
Print_ISBN :
0-7695-1766-8
DOI :
10.1109/WISE.2002.1181655