• DocumentCode
    3583928
  • Title

    Literature Characterization and Similarity Retrieval Based on Hierarchical Clustering

  • Author

    Li, Peng

  • Author_Institution
    Sch. of Inf., Linyi Normal Univ., Linyi, China
  • Volume
    1
  • fYear
    2009
  • Firstpage
    397
  • Lastpage
    400
  • Abstract
    The growing number of literature in journals database raises a new and challenging search problem: locating desired literature. Traditional keyword search is insufficient: the specific literature users require is possibly not captured. We introduce a new algorithm of hierarchical clustering. With this algorithm, we cluster the keywords into a concept tree, then we turn every literature into an induced tree. We propose a new method for theses retrieval, which based on concept similarity. This method improves in recall and precision.
  • Keywords
    information retrieval; literature; concept tree; hierarchical clustering; journals database; keywords; literature characterization; similarity retrieval; Association rules; Binary trees; Clustering algorithms; Databases; Filtering algorithms; Information retrieval; Keyword search; Search engines; Search problems; Software engineering; Hierarchical clustering; Journals database; Literature Characterization; Similarity search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, 2009. WCSE '09. WRI World Congress on
  • Print_ISBN
    978-0-7695-3570-8
  • Type

    conf

  • DOI
    10.1109/WCSE.2009.124
  • Filename
    5319138