Title :
A Novel Co-clustering Method with Intra-similarities
Author :
Wu, Jian-Sheng ; Lai, Jian-Huang ; Wang, Chang-Dong
Author_Institution :
Sch. of Inf. Sci. & Technol., Sun Yat-sen Univ., Guangzhou, China
Abstract :
Recently, co-clustering has become a topic of much interest because of its applications to many problems. It has been proved more effective than one-way clustering methods. But the existing co-clustering approaches just treat the document as a collection of words, disregarding the word sequences. They only consider the co-occurrence counts of words and documents, but do not take into account the similarities between words and similarities between documents. However, these similarity information can help improving the co-clustering. In this paper, we incorporate the word similarities and document similarities into the co-clustering algorithm, and propose a new co-clustering method. And we provide a theoretical analysis that our algorithm can converge to a local minimum. The empirical evaluation on publicly available data sets also shows that our algorithm is effective.
Keywords :
pattern clustering; text analysis; coclustering method; cooccurrence word count; document clustering; document similarity; similarity information; word collection; word sequence; word similarity; Algorithm design and analysis; Clustering algorithms; Decision support systems; Joints; Mutual information; Partitioning algorithms; Prototypes; co-clustering; document similarities; word similarities;
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
DOI :
10.1109/ICDMW.2011.15