DocumentCode :
178102
Title :
Boosted Multi-modal Supervised Latent Dirichlet Allocation for Social Event Classification
Author :
Shengsheng Qian ; Tianzhu Zhang ; Changsheng Xu
Author_Institution :
Inst. of Autom., Beijing, China
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
1999
Lastpage :
2004
Abstract :
With the rapidly increasing popularity of Social Media sites (e.g., Flickr, YouTube, and Facebook), it is convenient for users to share their own comments on many social events, which successfully facilitates social event generation, sharing and propagation and results in a large amount of user-contributed media data (e.g., images, videos, and texts) for a wide variety of real-world events of different types and scales. As a consequence, it has become more and more difficult to find exactly the interesting events from massive social media data, which is useful to browse, search and monitor social events by users or governments. To deal with these issues, we propose a novel boosted multi-modal supervised Latent Dirichlet Allocation (BMM-SLDA) for social event classification. Our BMM-SLDA has a number of advantages. (1) It can effectively exploit the multi-modality and the supervised information of social events jointly. (2) It is suitable to large-scale data analysis by utilizing boosting weighted sampling strategy to iteratively select a small subset data to efficiently train the corresponding topic models. (3) It effectively exploits boosting document weight distribution by classification error, and can iteratively learn new topic model to correct the previously misclassified documents. We evaluate our BMM-SLDA on a real-world dataset and show extensive results, which show that our model outperforms state-of-the-art methods.
Keywords :
data analysis; social networking (online); BMM-SLDA; Facebook; Flickr; YouTube; boosted multimodal supervised latent Dirichlet allocation; classification error; document weight distribution; large-scale data analysis; massive social media data; real-world dataset; social event classification; social media sites; social propagation; social sharing; topic models; user-contributed media data; weighted sampling strategy; Analytical models; Boosting; Media; Resource management; Training; Training data; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.349
Filename :
6977061
Link To Document :
بازگشت