• DocumentCode
    1963743
  • Title

    Research on Automatic Classification for Deep Web Query Interfaces

  • Author

    Lin, Peiguang ; Du, Yibing ; Tan, Xiaohua ; LV, Chao

  • Author_Institution
    Sch. of Comput. & Inf. Eng., Shandong Univ. of Finance, Jinan
  • fYear
    2008
  • fDate
    23-25 May 2008
  • Firstpage
    313
  • Lastpage
    317
  • Abstract
    In recent years, the Web is "deepened" rapidly and users have to browse quantities of Web sites to access Web databases in a specific domain. So, to build an unified query interface which integrates query interfaces of a domain to access various Web databases at the same time becomes a very important issue. In this paper, the schema characteristics of query interfaces and common attributes in a same domain are firstly analyzed, and it also gives a new representation of query interface, then the definition of "Form term" and "Function term" are proposed ,and a new similarity computing algorithm, literal and semantic based similarity computing (LSSC) is proposed, which is based on the two definitions. Secondly, a clustering algorithm for Deep Web query interfaces is given by combining LSSC and NQ algorithm: LSSC-NQ. Finally, experiments show that this algorithm can give accurate similarity computing, and cluster query interfaces efficiently, reliably and quickly.
  • Keywords
    Internet; classification; query processing; user interfaces; Web database access; Web site browsing; automatic classification; deep Web query interface clustering; literal similarity computing algorithm; semantic based similarity computing algorithm; unified query interface representation; Algorithm design and analysis; Chaos; Clustering algorithms; Clustering methods; Computer interfaces; Concrete; Data engineering; Databases; Finance; Information processing; Automatic Classification; Deep Web; Query Interface;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Processing (ISIP), 2008 International Symposiums on
  • Conference_Location
    Moscow
  • Print_ISBN
    978-0-7695-3151-9
  • Type

    conf

  • DOI
    10.1109/ISIP.2008.140
  • Filename
    4554105