• DocumentCode
    610072
  • Title

    Compressed Parameterized Pattern Matching

  • Author

    Beal, R. ; Adjeroh, Donald A.

  • Author_Institution
    Lane Dept. of Comput. Sci. & Electr. Eng., West Virginia Univ., Morgantown, WV, USA
  • fYear
    2013
  • fDate
    20-22 March 2013
  • Firstpage
    461
  • Lastpage
    470
  • Abstract
    Traditional pattern matching between strings, from the alphabet Σ, is well defined for both uncompressed and compressed sequences. Prior to this work, parameterized pattern matching (p-matching) was defined predominately by the matching between uncompressed parameterized strings (p-strings) from the constant alphabet Σ and the parameter alphabet II. In this work, we define the compressed parameterized pattern matching (compressed p-matching) problem to find all of the p-matches between a pattern P and text T, using only P and the compressed text Tc. Initially, we present parameterized compression (p-compression) as a new way to losslessly compress data to support p-matching. Experimentally, we show that p-compression is competitive with various other standard compression schemes. Subsequently, we provide the compression and decompression algorithms. Using p-compression, we address the compressed p-matching problem. Our general solution is independent of the underlying compression scheme. The results are further examined for the specific case of Tunstall codes.
  • Keywords
    data compression; image coding; image matching; image sequences; Tunstall code; compressed p-matching problem; compressed parameterized pattern matching problem; constant alphabet Σ; decompression algorithm; losslessly data compression; p-string; parameter alphabet Π; standard compression scheme; uncompressed parameterized string; uncompressed sequence; Arrays; Biological information theory; Dictionaries; Encoding; Pattern matching; Plagiarism; Standards; Tunstall codes; compressed pattern matching; lossless compression; p-border; p-match; p-string; parameterized border; parameterized matching; parameterized string;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference (DCC), 2013
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    978-1-4673-6037-1
  • Type

    conf

  • DOI
    10.1109/DCC.2013.54
  • Filename
    6543082