• DocumentCode
    3059424
  • Title

    Improving the Web text content by extracting significant pages into a Web site

  • Author

    Rios, Sebastián A. ; Velásquez, Juan D. ; Vera, Eduardo S. ; Yasuda, Hiroshi ; Aoki, Terumasa

  • Author_Institution
    Res. Center for Adv. Sci. & Technol., Tokyo Univ., Japan
  • fYear
    2005
  • fDate
    8-10 Sept. 2005
  • Firstpage
    32
  • Lastpage
    36
  • Abstract
    Web systems have reached a very important role in today´s business world. Every day organizations fight to keep their present clients and to gain new ones. In order to accomplish this goal it is very important to make precise changes in the Web site content. However, the development of these improvements is a complex and specialized task because of the nature of the Web data itself. We propose a novel approach to successfully make changes to improve the Web site content using text mining. We use a self organizing feature map (SOFM) to find the most relevant text content, and then we propose a reverse clustering analysis in order to extract the most significant pages of the whole Web site. The effectiveness of this method was experimentally tested in a real Web site.
  • Keywords
    Web sites; data mining; self-organising feature maps; Web site content; Web text content; reverse clustering analysis; self organizing feature map; text mining; Collaboration; Computer science; Continuous improvement; Data mining; Industrial engineering; Organizing; Testing; Text mining; Web page design; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2005. ISDA '05. Proceedings. 5th International Conference on
  • Print_ISBN
    0-7695-2286-6
  • Type

    conf

  • DOI
    10.1109/ISDA.2005.55
  • Filename
    1578756