DocumentCode
3732071
Title
How to Perform Incremental Clustering - A SOM Based View
Author
Chen Lei;Wu Chong
Author_Institution
Sch. of Manage., Harbin Inst. of Technol., Harbin, China
fYear
2015
Firstpage
450
Lastpage
455
Abstract
Due to fast development of network technique, internet users have to face to massive textual data every day. Because of unsupervised merit of clustering, clustering is a good solution for users to analyze and organize texts into categories. However, most of recent clustering algorithms conduct in static situation. That indicates, it doesn´t allow clustering algorithm to deal with novel data efficiently. When novel data appear, traditional clustering algorithms can´t change their structure easily. Obviously, this restrict is not fit to internet, since novel data appear at any time. For this reason, an incremental clustering algorithm is proposed in this paper to cluster incremental data. This algorithm has two factors. (a) It designs two measures to calculate feature´s ability and integrate them in similarity measurement by replacing concurrence based similarity measurements. (b) Based on proposed similarity measurement, this algorithm selects few samples from original texts to perform incremental clustering. Experimental results demonstrate that, after integrating feature´s capacity, our algorithm can obtain high quality to cluster texts.
Keywords
"Transportation","Big data","Smart cities"
Publisher
ieee
Conference_Titel
Intelligent Transportation, Big Data and Smart City (ICITBS), 2015 International Conference on
Type
conf
DOI
10.1109/ICITBS.2015.117
Filename
7384063
Link To Document