• DocumentCode
    3401301
  • Title

    Generic multi-document summarization using cluster refinement and NMF

  • Author

    Park, Sun ; An, Dong Un ; Cho, Youn Jeong

  • Author_Institution
    Adv. Grad. Educ. Center of Jeonbuk for Electron. & Inf. Technol.-BK21, Chonbuk Nat. Univ., Jeonju, South Korea
  • fYear
    2009
  • fDate
    14-17 Dec. 2009
  • Firstpage
    65
  • Lastpage
    70
  • Abstract
    In this paper, a generic summarization method that uses cluster refinement and NMF is introduced to extract meaningful sentences from documents. The proposed method uses cluster refinement to improve the quality of document clustering since it helps us to remove dissimilarity information easily and avoid biased inherent semantics of documents to be reflected in clusters by NMF. In addition, it uses the weighted semantic variable to select meaningful sentences because the extracted sentences are well covered with the major topics of document. The experimental results demonstrate that the proposed method has better performance than other methods that use the other methods.
  • Keywords
    feature extraction; pattern clustering; text analysis; cluster refinement; document clustering; generic multidocument summarization; meaningful sentence extraction; Clustering algorithms; Data mining; Diversity methods; Diversity reception; Educational technology; Feature extraction; Internet; Sun; Vectors; NMF; cluster coherence; cluster refinement; geneirc multi-document summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2009 IEEE International Symposium on
  • Conference_Location
    Ajman
  • Print_ISBN
    978-1-4244-5949-0
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2009.5407492
  • Filename
    5407492