DocumentCode
610072
Title
Compressed Parameterized Pattern Matching
Author
Beal, R. ; Adjeroh, Donald A.
Author_Institution
Lane Dept. of Comput. Sci. & Electr. Eng., West Virginia Univ., Morgantown, WV, USA
fYear
2013
fDate
20-22 March 2013
Firstpage
461
Lastpage
470
Abstract
Traditional pattern matching between strings, from the alphabet Σ, is well defined for both uncompressed and compressed sequences. Prior to this work, parameterized pattern matching (p-matching) was defined predominately by the matching between uncompressed parameterized strings (p-strings) from the constant alphabet Σ and the parameter alphabet II. In this work, we define the compressed parameterized pattern matching (compressed p-matching) problem to find all of the p-matches between a pattern P and text T, using only P and the compressed text Tc. Initially, we present parameterized compression (p-compression) as a new way to losslessly compress data to support p-matching. Experimentally, we show that p-compression is competitive with various other standard compression schemes. Subsequently, we provide the compression and decompression algorithms. Using p-compression, we address the compressed p-matching problem. Our general solution is independent of the underlying compression scheme. The results are further examined for the specific case of Tunstall codes.
Keywords
data compression; image coding; image matching; image sequences; Tunstall code; compressed p-matching problem; compressed parameterized pattern matching problem; constant alphabet Σ; decompression algorithm; losslessly data compression; p-string; parameter alphabet Π; standard compression scheme; uncompressed parameterized string; uncompressed sequence; Arrays; Biological information theory; Dictionaries; Encoding; Pattern matching; Plagiarism; Standards; Tunstall codes; compressed pattern matching; lossless compression; p-border; p-match; p-string; parameterized border; parameterized matching; parameterized string;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2013
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-1-4673-6037-1
Type
conf
DOI
10.1109/DCC.2013.54
Filename
6543082
Link To Document