DocumentCode :
2353372
Title :
An Efficient Bit-Parallel Multi-Patterns Word Searching Algorithm through Splitting the Text
Author :
Yadav, Ishadutta ; Singh, Bharat ; Agarwal, Suneeta ; Prasad, Rajesh
Author_Institution :
Dept. of Comput. Sci. & Eng., Motilal Nehru Nat. Inst. of Technol., Allahabad, India
fYear :
2009
fDate :
27-28 Oct. 2009
Firstpage :
406
Lastpage :
410
Abstract :
Word matching problem is to find all the occurrences of a pattern P[0...m-1] in the text T[0...n-1], where P neither contains any white space nor preceded and followed by space. In the multi-patterns word matching problem, all the occurrences of multiple word P0, P1, P2 ...Pr-1, (rges1) in the given text T are to be reported. In the present discussion, we assume that all the patterns have equal size m and our text T is offline. We further assume that m les w, where w is the word length of computer used. Ibrahiem et al. in 2008 have proposed an algorithm (WSA) for solving the word matching problem for single pattern by splitting the offline text into number of tables in the preprocessing phase. The main drawback of this algorithm was: after splitting the text into a number of tables, they search each occurrence of the pattern by the brute force manner in each table. In this paper, we extend this algorithm for multi-patterns word matching by using the technique of bit-parallel proposed by Baeza Yates, 1992. In this technique, after splitting the text into number of tables, we apply the shift-or algorithm to find the words of same length in the text T. The set of r multiple patterns is being handled by using the concept of classes of characters. This extended algorithm is called as multi-patterns word searching algorithm (MPWSA). Experimental results show that MPWSA algorithm is much faster than the previously proposed WSA algorithm.
Keywords :
string matching; text analysis; word processing; bit-parallel multipatterns word searching algorithm; multipatterns word matching problem; shift-or algorithm; string matching; text splitting; Communications technology; Computer science; Pattern matching; Space technology; White spaces; Algorithm; and word searching; offline searching; shift-or; string matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
Conference_Location :
Kottayam, Kerala
Print_ISBN :
978-1-4244-5104-3
Electronic_ISBN :
978-0-7695-3845-7
Type :
conf
DOI :
10.1109/ARTCom.2009.211
Filename :
5329368
Link To Document :
بازگشت