DocumentCode :
610072
Title :
Compressed Parameterized Pattern Matching
Author :
Beal, R. ; Adjeroh, Donald A.
Author_Institution :
Lane Dept. of Comput. Sci. & Electr. Eng., West Virginia Univ., Morgantown, WV, USA
fYear :
2013
fDate :
20-22 March 2013
Firstpage :
461
Lastpage :
470
Abstract :
Traditional pattern matching between strings, from the alphabet Σ, is well defined for both uncompressed and compressed sequences. Prior to this work, parameterized pattern matching (p-matching) was defined predominately by the matching between uncompressed parameterized strings (p-strings) from the constant alphabet Σ and the parameter alphabet II. In this work, we define the compressed parameterized pattern matching (compressed p-matching) problem to find all of the p-matches between a pattern P and text T, using only P and the compressed text Tc. Initially, we present parameterized compression (p-compression) as a new way to losslessly compress data to support p-matching. Experimentally, we show that p-compression is competitive with various other standard compression schemes. Subsequently, we provide the compression and decompression algorithms. Using p-compression, we address the compressed p-matching problem. Our general solution is independent of the underlying compression scheme. The results are further examined for the specific case of Tunstall codes.
Keywords :
data compression; image coding; image matching; image sequences; Tunstall code; compressed p-matching problem; compressed parameterized pattern matching problem; constant alphabet Σ; decompression algorithm; losslessly data compression; p-string; parameter alphabet Π; standard compression scheme; uncompressed parameterized string; uncompressed sequence; Arrays; Biological information theory; Dictionaries; Encoding; Pattern matching; Plagiarism; Standards; Tunstall codes; compressed pattern matching; lossless compression; p-border; p-match; p-string; parameterized border; parameterized matching; parameterized string;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference (DCC), 2013
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
978-1-4673-6037-1
Type :
conf
DOI :
10.1109/DCC.2013.54
Filename :
6543082
Link To Document :
بازگشت