Title :
Creating protein sequence patterns using efficient regular expressions in bioinformatics research
Author :
Mulder, Michael ; Nezlek, George S.
Author_Institution :
Grand Valley State Univ., Allendale, MI
Abstract :
Regular expressions are a common feature of contemporary scripting and programming languages, such as Perl and Java, as well as part of conventional database management systems by their incorporation into various dialects of SQL. Bioinformatics researchers encounter problem scenarios in protein sequence analysis that are ideally suited for the application of regular expression algorithms. This paper describes a representative application of regular expressions in this problem domain, and demonstrates how the effective use of regular expressions can serve to facilitate more efficient and more effective bioinformatics research
Keywords :
biology computing; data analysis; pattern matching; proteins; sequences; Java; Perl; SQL; bioinformatics research; database management system; programming language; protein sequence pattern analysis; regular expression algorithm; scripting language; Amino acids; Bioinformatics; Computer languages; Database systems; Hidden Markov models; Java; Management information systems; Pattern matching; Protein sequence; Sequences;
Conference_Titel :
Information Technology Interfaces, 2006. 28th International Conference on
Conference_Location :
Cavtat/Dubrovnik
Print_ISBN :
953-7138-05-4
DOI :
10.1109/ITI.2006.1708479