DocumentCode :
2145601
Title :
Pattern Matching with Flexible Wildcards and Recurring Characters
Author :
Wang, Haiping ; Xie, Fei ; Hu, Xuegang ; Li, Peipei ; Wu, Xindong
Author_Institution :
Sch. of Comput. Sci. & Inf. Eng., Hefei Univ. of Technol., Hefei, China
fYear :
2010
fDate :
14-16 Aug. 2010
Firstpage :
782
Lastpage :
786
Abstract :
Pattern matching is an important task, which is widely used in many fields, such as information retrieval and bioinformatics. Recently, a much more flexible pattern matching problem with wildcards has been proposed. Chen et al. introduced local constraints, global constraints and the one-off condition into the task of pattern matching, and the most representative algorithm SAIL was designed. However, the performance of SAIL is not analyzed well, which affects its application. Therefore, this paper analyzes the performance of SAIL in-depth, and discovers that the matching result is closely related to the features of patterns. Meanwhile, the completeness of SAIL in the pattern matching with no-recurring characters is proved, and an improved algorithm, named RSAIL, is proposed for pattern matching with recurring tail characters. Extensive experiments demonstrate that RSAIL improves the number of matches by 2.2% compared to SAIL.
Keywords :
pattern matching; RSAIL; bioinformatics; flexible wildcards; information retrieval; pattern matching; recurring characters; representative algorithm; tail characters; Algorithm design and analysis; Approximation algorithms; Bioinformatics; Classification algorithms; Information retrieval; Pattern matching; USA Councils; SAIL; completeness; matching; wildcard;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-7964-1
Type :
conf
DOI :
10.1109/GrC.2010.156
Filename :
5576076
Link To Document :
بازگشت