• DocumentCode
    3202679
  • Title

    Web Page Clustering Based on Searching Keywords

  • Author

    Li, Taoying ; Chen, Yan

  • Author_Institution
    Transp. Manage. Coll., Dalian Maritime Univ., Dalian, China
  • Volume
    3
  • fYear
    2010
  • fDate
    11-12 May 2010
  • Firstpage
    1163
  • Lastpage
    1166
  • Abstract
    In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then clustering algorithm was chosen to group pages of searching results according to matching degree. Next we used duplicated pages deletion to detect and remove duplicated pages with same titles and abstracts. Finally, the proposed algorithm is applied in practice and results show that it is effective and feasible for solving information explosion on Web.
  • Keywords
    Internet; data mining; pattern clustering; Web crawling operation; Web page clustering; duplicated pages deletion; matching degree; searching keywords; Automation; Clustering algorithms; Couplings; Data mining; Explosions; Partitioning algorithms; Transportation; Web mining; Web pages; Web services; matching degree; searching degree; web clustering; web mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computation Technology and Automation (ICICTA), 2010 International Conference on
  • Conference_Location
    Changsha
  • Print_ISBN
    978-1-4244-7279-6
  • Electronic_ISBN
    978-1-4244-7280-2
  • Type

    conf

  • DOI
    10.1109/ICICTA.2010.53
  • Filename
    5523220