• DocumentCode
    717998
  • Title

    Clustering improvement via integrating with sparse topical coding

  • Author

    Ahmadi, Parvin ; Kaviani, Razie ; Gholampour, Iman ; Tabandeh, Mahmoud

  • Author_Institution
    Dept. of Electr. Eng., Sharif Univ. of Technol., Tehran, Iran
  • fYear
    2015
  • fDate
    10-14 May 2015
  • Firstpage
    466
  • Lastpage
    471
  • Abstract
    Topic modeling can improve document clustering by projecting documents into a topic space. By document, we mean a general concept. Document can be an image, a video, a textual document or each data which can be described in bag-of-words model based on the histogram of its features. In this paper, we introduce a clustering method based on Sparse Topical Coding (STC). In the proposed method, document clustering and topic modeling are integrated into a unified framework and jointly performed to achieve the best clustering performance. Our method clusters the documents based on STC topic modeling used for mining the topics and K-means clustering used for discovering latent groups in document collection. Experimental results show the effectiveness of our proposed clustering approach.
  • Keywords
    compressed sensing; document image processing; image coding; pattern clustering; text analysis; video signal processing; visual databases; word processing; K-means clustering; STC topic modeling; bag-of-words model; clustering improvement; document clustering; document collection; sparse topical coding; textual document; topic space; Conferences; Decision support systems; Electrical engineering; Document clustering; K-means; Sparse Topical Coding (STC); topic model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering (ICEE), 2015 23rd Iranian Conference on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4799-1971-0
  • Type

    conf

  • DOI
    10.1109/IranianCEE.2015.7146260
  • Filename
    7146260