DocumentCode :
3695386
Title :
A semi-supervised fuzzy co-clustering framework and application to twitter data analysis
Author :
Katsuhiro Honda;Seiki Ubukata;Akira Notsu;Norimitsu Takahashi;Yutaka Ishikawa
Author_Institution :
Graduate School of Engineering, Osaka Prefecture University, Sakai, 599-8531 Japan
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
1
Lastpage :
4
Abstract :
Semi-supervised clustering is an efficient scheme for utilizing data with partial class information, where unsupervised data distributions are estimated under some supports of partial supervised class information. In this paper, a novel framework for performing fuzzy co-clustering of cooccurrence information with partial supervision is proposed, which is induced by multinomial mixture concept. Co-clustering is useful for extracting object-item pair-wise clusters from cooccurrence information and has been utilized in various applications such as document-keyword analysis and customer-products purchase history data analysis. Several experimental results including a twitter data analysis demonstrate the ability of improving the classification quality of the fuzzified co-cluster structural knowledge. Then, the proposed semi-supervised framework is expected to be a powerful tool in Big Data analysis with huge volumes of data but partial supervisions only.
Keywords :
"Training","Twitter","Supervised learning","Data analysis","Estimation","Mixture models","Linear programming"
Publisher :
ieee
Conference_Titel :
Informatics, Electronics & Vision (ICIEV), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICIEV.2015.7334057
Filename :
7334057
Link To Document :
بازگشت