DocumentCode
2069341
Title
PSST... the probabilistic sequence search tool
Author
Miller, Crispin J. ; Attwood, Teresa K.
Author_Institution
Dept. of Comput. Sci., Manchester Univ., UK
fYear
2001
fDate
4-6 Nov 2001
Firstpage
33
Lastpage
40
Abstract
Whole genome comparison and clustering cannot be routinely performed without access to significant resources. If as expected, repositories continue to grow at the current rate, increasingly large and expensive systems will be required in order to maintain the status quo. The high-proportion of uncharacterised gene-sequences, combined with the fact that the majority of sequence analysis techniques are alignment-based, raises the possibility that alternative approaches might be able to identify relationships that have otherwise been missed. There is a need for alternative ways to predict function. PSST is an analysis tool with parallels to both pairwise algorithms and multiple motif-based pattern approaches. It is significantly faster than BLAST, and for some families including GPCRs, the tool is more sensitive and selective as well. For others it is worse. This paper describes the algorithm, its implementation, its evaluation against a diverse set of protein families, and discusses the reasons behind its behaviour
Keywords
biology computing; genetics; sequences; PSST; Probabilistic Sequence Search Tool; gene sequences; multiple motif-based pattern approaches; pairwise algorithms; whole genome clustering; whole genome comparison; Algorithm design and analysis; Biochemistry; Bioinformatics; Birth disorders; Computer science; Databases; Genomics; Hip; Proteins; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering Conference, 2001. Proceedings of the IEEE 2nd International Symposium on
Conference_Location
Bethesda, MD
Print_ISBN
0-7695-1423-5
Type
conf
DOI
10.1109/BIBE.2001.974409
Filename
974409
Link To Document