Title :
Clustering improvement via integrating with sparse topical coding
Author :
Ahmadi, Parvin ; Kaviani, Razie ; Gholampour, Iman ; Tabandeh, Mahmoud
Author_Institution :
Dept. of Electr. Eng., Sharif Univ. of Technol., Tehran, Iran
Abstract :
Topic modeling can improve document clustering by projecting documents into a topic space. By document, we mean a general concept. Document can be an image, a video, a textual document or each data which can be described in bag-of-words model based on the histogram of its features. In this paper, we introduce a clustering method based on Sparse Topical Coding (STC). In the proposed method, document clustering and topic modeling are integrated into a unified framework and jointly performed to achieve the best clustering performance. Our method clusters the documents based on STC topic modeling used for mining the topics and K-means clustering used for discovering latent groups in document collection. Experimental results show the effectiveness of our proposed clustering approach.
Keywords :
compressed sensing; document image processing; image coding; pattern clustering; text analysis; video signal processing; visual databases; word processing; K-means clustering; STC topic modeling; bag-of-words model; clustering improvement; document clustering; document collection; sparse topical coding; textual document; topic space; Conferences; Decision support systems; Electrical engineering; Document clustering; K-means; Sparse Topical Coding (STC); topic model;
Conference_Titel :
Electrical Engineering (ICEE), 2015 23rd Iranian Conference on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-1971-0
DOI :
10.1109/IranianCEE.2015.7146260