Title :
Window-chained longest common subsequence: Common event matching in sequences
Author :
Chunyao Song ; Tingjian Ge
Author_Institution :
Dept. of Comput. Sci., Univ. of Massachusetts, Lowell, MA, USA
Abstract :
Sequence data is prevalent, and event processing over sequences is increasingly important in this Big Data era, drawing much attention from both research and industry. In this paper, we address a novel problem, which is to find common event subsequences from two long sequences. This problem is well motivated, with applications in diverse domains. We propose the window-chained longest common subsequence (WCLCS) semantics, and argue that the traditional longest common subsequence (LCS) cannot serve this need. We then devise efficient algorithms to solve this problem by reducing it to a graph problem. We also propose two more methods to improve the performance: one is based on informed search and exploration, and the other is an approximation algorithm with accuracy guarantees. We finally carry out a systematic experimental evaluation using two real-world datasets and some synthetic datasets.
Keywords :
Big Data; approximation theory; graph theory; sequences; Big Data; WCLCS semantics; approximation algorithm; common event matching; event processing; event subsequences; graph problem; real-world datasets; sequence data; synthetic datasets; systematic experimental evaluation; window-chained longest-common subsequence semantics; Algorithm design and analysis; Approximation algorithms; Companies; Diabetes; Heuristic algorithms; Semantics; Sensors;
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICDE.2015.7113331