DocumentCode
429470
Title
The BlastNP: a novel, sensitive sequence similarity searching method using overlappingly translated sequences
Author
Biro, J.C. ; Biro, J.M.K.
Author_Institution
Homulus Informatics, San Francisco, CA, USA
Volume
1
fYear
2004
fDate
1-5 Sept. 2004
Firstpage
2777
Lastpage
2780
Abstract
An alternative method to TblastX has been developed, known as blastNP. Nucleic acids in database and query sequences were translated into overlapping protein-like sequences (overlappingly translated sequences or OTSs) before searching with blastP. Thus, each nucleic acid sequence is represented by a single "protein like" sequence (instead of three hypothetical proteins in different reading frames). The BlastNP method is defined as a BlastP that is performed on an overlappingly translated nucleic acid database using a similarly converted nucleic acid query. The specificity and sensitivity of blastNP and TblastX is quantitatively very similar, except that blastNP is more sensitive to detect short sequence similarities (less than 50 residues). However, a qualitative comparison of the observed similarities showed that only 56% was detected by both methods, but 22% was indicated only by blastNP and 22% only by TblastX. For example, a statistically significant similarity between prion protein (PrP) and transcriptions factors (TF) was only detected by blastNP. A signal amplification was seen when OTS sequences were used in similarity visualisation methods (like LALIGN) instead of nucleic acids.
Keywords
biochemistry; biology computing; genetics; macromolecules; molecular biophysics; proteins; query processing; statistical analysis; Basic Local Alignment Search Tool; BlastNP; bioinformatics; codon; nucleic acid query; nucleic acid sequence; nucleotide-nucleotide BLAST; overlappingly translated nucleic acid database; overlappingly translated sequences; prion protein; protein sequence; protein-protein BLAST; query sequences; signal amplification; similarity searching method; statistical analysis; transcriptions factors; translated sequences; Amino acids; Code standards; Databases; Genetics; History; Humans; Informatics; Insulin; Proteins; Sequences; Blast; bioinformatics methods; codon; nucleic acid; overlapping translation; protein;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE
Conference_Location
San Francisco, CA
Print_ISBN
0-7803-8439-3
Type
conf
DOI
10.1109/IEMBS.2004.1403794
Filename
1403794
Link To Document