• DocumentCode
    2546201
  • Title

    An Efficient Multi-Patterns Parameterized String Matching Algorithm with Super Alphabet

  • Author

    Prasad, Rajesh ; Agarwal, Suneeta

  • Author_Institution
    Dept. of Comput. Sci. & Eng., LDC Inst. of Tech. Studies, Allahabad
  • Volume
    1
  • fYear
    2009
  • fDate
    22-24 Jan. 2009
  • Firstpage
    536
  • Lastpage
    540
  • Abstract
    In the parameterized string matching, a given pattern P is said to match with a sub-string t of the text T, if there exist a bijection from the symbols of P to the symbols of t. This problem has an important application in software maintenance where it is required to find equivalency between two sections of codes. Two sections of codes are said to be equivalent if one can be transformed into the other by renaming identifiers and variables only. In this paper, we extend single pattern exact shift-or string matching algorithm to find all parameterized occurrences of multiple patterns P0, P1, P2 ...Pr-1, (rges1), each of equal size m, in the text T. The set of r multiple patterns is being handled by using the concept of classes of characters. The new algorithm is named as multi-pattern parameterized shift-or (MPSO) string matching algorithm. We further extend MPSO by using the concept of super alphabets. Implementation results show that by using a super alphabet of size s, the algorithm (MPSO) is speeded-up by a factor of s, where s is the size of the super alphabet (i.e. s is the number of characters processed simultaneously). By using multi-pattern parameterized string matching, the search time is lesser than individual pattern searching in the text. We also show the performance of super alphabet MPSO with respect to duplicity present in the code. However these algorithms are applicable only when pattern length (m) is less than or equal to word length (w) of computer used (i.e. mlesw).
  • Keywords
    automata theory; string matching; bit-parallelism; code; finite automata; multipattern parameterized shift-or; multiple patterns; software maintenance; string matching algorithm; super alphabet; Application software; Computer science; Pattern matching; Plagiarism; Software maintenance; Algorithm; bit-parallelism; finite automata; prev-encoding and parameterized matching; shift-or;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Engineering and Technology, 2009. ICCET '09. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-3334-6
  • Type

    conf

  • DOI
    10.1109/ICCET.2009.199
  • Filename
    4769524