Title :
Single pattern search implementations in a cluster computing environment
Author :
Prasad, J.C. ; Panicker, K.S.M.
Author_Institution :
Dept. of Comput. Sci. & Eng., Dr. M.G.R. Univ., Angamaly, India
Abstract :
This paper provides an overview of different string matching algorithms in parallel environments. In this work, we have evaluated several algorithms, such as Knuth-Morris-Pratt, Boyer-Moore algorithm, Boyer Moore Horspool Algorithm, Zhu Takaoka algorithm, Quick Search Algorithm, BR Algorithm, Fast Searching algorithm, SSABS algorithm, TVSBS algorithm, ZTMBH algorithm and BRBMH algorithm. Static pattern of string matching algorithms are widely used for searching. For comparisons of speed different algorithms are implemented with parallel programming technique of MPI Standard, for a large text file with a pattern of small length. A Beowulf based homogenous cluster architecture was developed which has a performance of 81 Gigabits floating point operations per second in searching environment. The result of pattern searching showed the variation of performance with this customized cluster architecture for variable number of compute nodes with different pattern lengths. Nearly 80% of the searching code part works in parallel. The performance of string search algorithm is based on the network bandwidth and the selection of algorithms used. Depending on certain applications, different algorithm can be used. It is observed that the cluster searching architecture provides better speed with different algorithms. In general, from the experiments, BRBMH Algorithm is found to be efficient for a general string searching applications.
Keywords :
message passing; parallel programming; pattern clustering; search problems; string matching; BR algorithm; BRBMH algorithm; Beowulf based homogenous cluster architecture; Boyer Moore Horspool algorithm; Knuth-Morris-Pratt algorithm; MPI standard; SSABS algorithm; TVSBS algorithm; ZTMBH algorithm; Zhu Takaoka algorithm; cluster computing environment; fast searching algorithm; network bandwidth; parallel environment; parallel programming; quick search algorithm; searching code; single pattern search; static pattern; string matching; string search algorithm; Algorithm design and analysis; Clustering algorithms; Complexity theory; Computer architecture; Program processors; Signal processing algorithms; Software algorithms; Cluster Architecture; Message Passing Interface; Parallel Processing; Search Algorithms;
Conference_Titel :
Digital Ecosystems and Technologies (DEST), 2010 4th IEEE International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-5551-5
DOI :
10.1109/DEST.2010.5610617