• DocumentCode
    1004479
  • Title

    Classifying Web pages employing a probabilistic neural network

  • Author

    Anagnostopoulos, I. ; Anagnostopoulos, C. ; Loumos, V. ; Kayafas, E.

  • Author_Institution
    Dept. of Commun., Electron. & Inf. Syst., Nat. Tech. Univ. of Athens, Greece
  • Volume
    151
  • Issue
    3
  • fYear
    2004
  • fDate
    6/7/2004 12:00:00 AM
  • Firstpage
    139
  • Lastpage
    150
  • Abstract
    The paper proposes a system capable of identifying and categorising Web pages on the basis of information filtering. The system is a three-layer probabilistic neural network (PNN) with biases and radial basis neurons in the middle layer and competitive neurons in the output layer. The domain of study involves the e-commerce area. Thus, the PNN scopes to identify e-commerce Web pages and classify them to the respective type according to a framework which describes the fundamental phases of commercial transactions in the Web. The system was tested with many types of Web pages, demonstrating the robustness of the method, since no restrictions were imposed except for the language of the content, which is English. The probabilistic classifier was used for estimating the population of specific e-commerce Web pages. Potential applications involve surveying Web activity in commercial servers, as well as Web page classification in largely expanding information areas like e-government or news and media.
  • Keywords
    Web sites; classification; electronic commerce; information retrieval; learning (artificial intelligence); radial basis function networks; uncertainty handling; Web page classification; competitive neurons; e-commerce; information filtering; probabilistic classifier; probabilistic neural network; radial basis neurons; training neural network;
  • fLanguage
    English
  • Journal_Title
    Software, IEE Proceedings -
  • Publisher
    iet
  • ISSN
    1462-5970
  • Type

    jour

  • DOI
    10.1049/ip-sen:20040121
  • Filename
    1304277