• DocumentCode
    2353372
  • Title

    An Efficient Bit-Parallel Multi-Patterns Word Searching Algorithm through Splitting the Text

  • Author

    Yadav, Ishadutta ; Singh, Bharat ; Agarwal, Suneeta ; Prasad, Rajesh

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Motilal Nehru Nat. Inst. of Technol., Allahabad, India
  • fYear
    2009
  • fDate
    27-28 Oct. 2009
  • Firstpage
    406
  • Lastpage
    410
  • Abstract
    Word matching problem is to find all the occurrences of a pattern P[0...m-1] in the text T[0...n-1], where P neither contains any white space nor preceded and followed by space. In the multi-patterns word matching problem, all the occurrences of multiple word P0, P1, P2 ...Pr-1, (rges1) in the given text T are to be reported. In the present discussion, we assume that all the patterns have equal size m and our text T is offline. We further assume that m les w, where w is the word length of computer used. Ibrahiem et al. in 2008 have proposed an algorithm (WSA) for solving the word matching problem for single pattern by splitting the offline text into number of tables in the preprocessing phase. The main drawback of this algorithm was: after splitting the text into a number of tables, they search each occurrence of the pattern by the brute force manner in each table. In this paper, we extend this algorithm for multi-patterns word matching by using the technique of bit-parallel proposed by Baeza Yates, 1992. In this technique, after splitting the text into number of tables, we apply the shift-or algorithm to find the words of same length in the text T. The set of r multiple patterns is being handled by using the concept of classes of characters. This extended algorithm is called as multi-patterns word searching algorithm (MPWSA). Experimental results show that MPWSA algorithm is much faster than the previously proposed WSA algorithm.
  • Keywords
    string matching; text analysis; word processing; bit-parallel multipatterns word searching algorithm; multipatterns word matching problem; shift-or algorithm; string matching; text splitting; Communications technology; Computer science; Pattern matching; Space technology; White spaces; Algorithm; and word searching; offline searching; shift-or; string matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
  • Conference_Location
    Kottayam, Kerala
  • Print_ISBN
    978-1-4244-5104-3
  • Electronic_ISBN
    978-0-7695-3845-7
  • Type

    conf

  • DOI
    10.1109/ARTCom.2009.211
  • Filename
    5329368