Title :
A Fuzzy Approach to Clustering of Text Documents Based on MapReduce
Author :
Hu Zongzhen ; Zhu Weina ; Li Yu E ; Du Xiaojuan ; Yan Fan
Author_Institution :
Dept. of Comput. Sci., Yunnan Univ., Kunming, China
Abstract :
This paper discusses text clustering based on a parallel computing platform called Hadoop. According to the concept of fuzzy set, this paper presents a fuzzy clustering approach for document categorization. Furthermore, a parallel text clustered framework based on MapReduce was designed according to the proposed text clustering procedure.
Keywords :
fuzzy set theory; parallel programming; pattern clustering; text analysis; Hadoop parallel computing platform; MapReduce; document categorization; fuzzy approach; fuzzy clustering approach; fuzzy set concept; text clustering procedure; text document clustering; Algorithm design and analysis; Clustering algorithms; Data mining; Educational institutions; Information entropy; Programming; Training; Distributed computing; Fuzzy approach; Hadoop; MapReduce; Parallel computing; Text document clustering;
Conference_Titel :
Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on
Conference_Location :
Shiyang
DOI :
10.1109/ICCIS.2013.181