• DocumentCode
    2625865
  • Title

    Clustering XML Documents Based on the Weight of Frequent Structures

  • Author

    Hwang, Jeong Hee ; Gu, Mi Sug

  • Author_Institution
    Namseoul Univ., Chonan
  • fYear
    2007
  • fDate
    21-23 Nov. 2007
  • Firstpage
    845
  • Lastpage
    849
  • Abstract
    The previous clustering methods of XML document group XML documents with similar structures, measuring structural similarity and distance between XML documents. In this paper, however, we propose a novel clustering method for XML documents using the weight of frequent structures in XML documents, considering that an XML document as a transaction and the extracted structures from XML documents as items of a transaction. Our experiment results show the high speed and cluster cohesion of our clustering method.
  • Keywords
    XML; document handling; XML documents; clustering methods; structural similarity; Bioinformatics; Books; Clustering algorithms; Clustering methods; Computer science; Databases; Information technology; Internet; Laboratories; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Convergence Information Technology, 2007. International Conference on
  • Conference_Location
    Gyeongju
  • Print_ISBN
    0-7695-3038-9
  • Type

    conf

  • DOI
    10.1109/ICCIT.2007.101
  • Filename
    4420365