• DocumentCode
    3091772
  • Title

    The categorisation of hidden Web databases through concept specificity and coverage

  • Author

    Hedley, Yih-Ling ; Younas, Muhammad ; James, Anne

  • Author_Institution
    Sch. of Math. & Inf. Sci., Coventry Univ., UK
  • Volume
    2
  • fYear
    2005
  • fDate
    28-30 March 2005
  • Firstpage
    671
  • Abstract
    Hidden Web databases maintain a collection of specialised documents, which are dynamically generated in response to users´ queries. The categorisation of such databases into a set of predefined categories has been widely employed to assist users in their information searches. In this paper we present a technique that automatically categorises a document database through its content summary and concepts described by their specificity and coverage. Experimental results show that our approach categorises databases with a larger number of relevant categories.
  • Keywords
    classification; content management; content-based retrieval; document handling; information retrieval; information retrieval systems; concept coverage; concept specificity; content summary; document database; hidden Web databases; information search; Data mining; Databases; Frequency; Information retrieval; Sampling methods; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications, 2005. AINA 2005. 19th International Conference on
  • ISSN
    1550-445X
  • Print_ISBN
    0-7695-2249-1
  • Type

    conf

  • DOI
    10.1109/AINA.2005.323
  • Filename
    1423772