Title :
Clustering sequential data into hierarchical patterns
Author :
Xinying Song;Johnson Apacible
Author_Institution :
School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
fDate :
5/1/2011 12:00:00 AM
Abstract :
Sequential data, i.e. text string, is a common yet important data type. Automatically discovering patterns for sequential data is useful but challenging. In this paper, we address this task by clustering strings into hierarchical patterns. Such pattern hierarchy is particularly helpful for users to discover meaningful patterns as well as to interpret the encapsulated knowledge. We present the clustering algorithm in details and evaluate it on a large, real dataset of street addresses. The experiments demonstrate the effectiveness of our approach, making it a useful tool for analyzing and interpreting sequential data.
Keywords :
Pattern matching
Conference_Titel :
Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
Print_ISBN :
978-1-61284-485-5
DOI :
10.1109/ICCSN.2011.6014411