DocumentCode :
3643428
Title :
Clustering sequential data into hierarchical patterns
Author :
Xinying Song;Johnson Apacible
Author_Institution :
School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
154
Lastpage :
158
Abstract :
Sequential data, i.e. text string, is a common yet important data type. Automatically discovering patterns for sequential data is useful but challenging. In this paper, we address this task by clustering strings into hierarchical patterns. Such pattern hierarchy is particularly helpful for users to discover meaningful patterns as well as to interpret the encapsulated knowledge. We present the clustering algorithm in details and evaluate it on a large, real dataset of street addresses. The experiments demonstrate the effectiveness of our approach, making it a useful tool for analyzing and interpreting sequential data.
Keywords :
Pattern matching
Publisher :
ieee
Conference_Titel :
Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
Print_ISBN :
978-1-61284-485-5
Type :
conf
DOI :
10.1109/ICCSN.2011.6014411
Filename :
6014411
Link To Document :
بازگشت