DocumentCode
1004479
Title
Classifying Web pages employing a probabilistic neural network
Author
Anagnostopoulos, I. ; Anagnostopoulos, C. ; Loumos, V. ; Kayafas, E.
Author_Institution
Dept. of Commun., Electron. & Inf. Syst., Nat. Tech. Univ. of Athens, Greece
Volume
151
Issue
3
fYear
2004
fDate
6/7/2004 12:00:00 AM
Firstpage
139
Lastpage
150
Abstract
The paper proposes a system capable of identifying and categorising Web pages on the basis of information filtering. The system is a three-layer probabilistic neural network (PNN) with biases and radial basis neurons in the middle layer and competitive neurons in the output layer. The domain of study involves the e-commerce area. Thus, the PNN scopes to identify e-commerce Web pages and classify them to the respective type according to a framework which describes the fundamental phases of commercial transactions in the Web. The system was tested with many types of Web pages, demonstrating the robustness of the method, since no restrictions were imposed except for the language of the content, which is English. The probabilistic classifier was used for estimating the population of specific e-commerce Web pages. Potential applications involve surveying Web activity in commercial servers, as well as Web page classification in largely expanding information areas like e-government or news and media.
Keywords
Web sites; classification; electronic commerce; information retrieval; learning (artificial intelligence); radial basis function networks; uncertainty handling; Web page classification; competitive neurons; e-commerce; information filtering; probabilistic classifier; probabilistic neural network; radial basis neurons; training neural network;
fLanguage
English
Journal_Title
Software, IEE Proceedings -
Publisher
iet
ISSN
1462-5970
Type
jour
DOI
10.1049/ip-sen:20040121
Filename
1304277
Link To Document