DocumentCode :
1855992
Title :
A computational approach for gene assembly and exon annotation based on BLAST
Author :
Deng, Xutao ; Ali, Hesham H.
Author_Institution :
Coll. of Inf. Sci. & Technol., Nebraska Univ., Omaha, NE
fYear :
2005
fDate :
22-25 May 2005
Lastpage :
6
Abstract :
Accurate gene assembly and precise exon annotations are two of the key goals of human genome project. Existing gene reference sequences and exon annotations are far from perfection. This paper introduces a new greedy algorithm which makes use of mRNA reference sequence and BLAST tools from NCBI (National Center for Biotechnology Information) to effectively assemble and annotate gene structures. Four pipelined components are included in this approach. 1. Blast parser: extract mRNA-DNA local alignment pairs. 2. Chain finder: transform local alignments to spliced alignment. 3. Assembler: assemble multiple DNA sequences into a continuous DNA sequence based on their spliced alignments with a given mRNA sequence. 4. Annotator: resolve exon-intron boundary based on splicing signals. Test results on one sample set of human genes show that gene assembly and exon annotation using the proposed approach is significantly better than contig references from NCBI. The software is available upon request
Keywords :
DNA; biology computing; genetics; grammars; greedy algorithms; search problems; sequences; BLAST tools; Blast parser; chain finder; continuous DNA sequence; exon annotation; exon-intron boundary; gene assembly; gene reference sequences; greedy algorithm; human genome project; mRNA reference sequence; mRNA-DNA local alignment pair extraction; multiple DNA sequences; spliced alignment; splicing signals; Assembly; Bioinformatics; Biotechnology; DNA; Data mining; Genomics; Greedy algorithms; Humans; Sequences; Signal resolution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electro Information Technology, 2005 IEEE International Conference on
Conference_Location :
Lincoln, NE
Print_ISBN :
0-7803-9232-9
Type :
conf
DOI :
10.1109/EIT.2005.1627022
Filename :
1627022
Link To Document :
بازگشت