• DocumentCode
    710121
  • Title

    Window-chained longest common subsequence: Common event matching in sequences

  • Author

    Chunyao Song ; Tingjian Ge

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Massachusetts, Lowell, MA, USA
  • fYear
    2015
  • fDate
    13-17 April 2015
  • Firstpage
    759
  • Lastpage
    770
  • Abstract
    Sequence data is prevalent, and event processing over sequences is increasingly important in this Big Data era, drawing much attention from both research and industry. In this paper, we address a novel problem, which is to find common event subsequences from two long sequences. This problem is well motivated, with applications in diverse domains. We propose the window-chained longest common subsequence (WCLCS) semantics, and argue that the traditional longest common subsequence (LCS) cannot serve this need. We then devise efficient algorithms to solve this problem by reducing it to a graph problem. We also propose two more methods to improve the performance: one is based on informed search and exploration, and the other is an approximation algorithm with accuracy guarantees. We finally carry out a systematic experimental evaluation using two real-world datasets and some synthetic datasets.
  • Keywords
    Big Data; approximation theory; graph theory; sequences; Big Data; WCLCS semantics; approximation algorithm; common event matching; event processing; event subsequences; graph problem; real-world datasets; sequence data; synthetic datasets; systematic experimental evaluation; window-chained longest-common subsequence semantics; Algorithm design and analysis; Approximation algorithms; Companies; Diabetes; Heuristic algorithms; Semantics; Sensors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2015 IEEE 31st International Conference on
  • Conference_Location
    Seoul
  • Type

    conf

  • DOI
    10.1109/ICDE.2015.7113331
  • Filename
    7113331