DocumentCode :
3144514
Title :
Effects of Suffix Repetition Rates of a String on the Performance of String Matching Algorithms
Author :
Wang, Yang
Author_Institution :
Comput. Sci. Dept., Missouri State Univ., Springfield, MO, USA
fYear :
2009
fDate :
1-3 June 2009
Firstpage :
53
Lastpage :
58
Abstract :
The highly efficient Boyer-Moore´s string matching algorithm utilizes information on multi-occurrences of string suffixes in a pattern string to avoid backtracks in searching the pattern string. One hypothesis is that Boyer-Moore´s algorithm even benefits more from highly self-repetitive patterns. In this paper, the author studies how multi-occurrences of string suffixes affect the performance of the Boyer-Moore´s algorithm as well as some other well known string search algorithms. The paper introduces a new concept of suffix repetition rate (SRR) to measure how frequently the suffixes of a string occur inside of the string. Using this measurement, experiments with several thousands patterns over the entire range of SRRs have been carried out, and the results show that increasing of SRR on pattern strings does not improve the performance of a searching algorithm in terms of efficiency.
Keywords :
search problems; string matching; Boyer-Moore´s string matching algorithm; pattern string; string search algorithm; suffix repetition rate effect; Application software; Computer science; DNA; Data compression; High performance computing; Information science; Information security; Keyword search; Pattern matching; Sequences; String matching; algorithm efficiency; string suffix; suffix repetition rates;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3641-5
Type :
conf
DOI :
10.1109/ICIS.2009.29
Filename :
5223132
Link To Document :
بازگشت