DocumentCode :
2069534
Title :
An XML application for genomic data interoperation
Author :
Cheung, Kei-Hoi ; Liu, Yang ; Kumar, Anuj ; Snyder, Michael ; Gerstein, Mark ; Miller, Perry
Author_Institution :
Center for Med. Informatics, Yale Univ., New Haven, CT, USA
fYear :
2001
fDate :
4-6 Nov 2001
Firstpage :
97
Lastpage :
103
Abstract :
As the eXtensible Markup Language (XML) becomes a popular or standard language for exchanging data over the Internet/Web, there are a growing number of genome Web sites that make their data available in XML format. Publishing genomic data in XML format alone would not be that useful if there is a lack of development of software applications that could take advantage of the XML technology to process these XML-formatted data. This paper illustrates the usefulness of XML in representing and interoperating genomic data between two different data sources (Snyder\´s laboratory at Yale and SGD at Stanford). In particular, we compare the locations of transposon insertions in the yeast DNA sequences that have been identified by BLAST searches with the chromosomal locations of the yeast open reading frames (ORFs) stored in SGD. Such a comparison allows us to characterize the transposon insertions by indicating whether they fall into any ORFs (which may potentially encode proteins that possess essential biological functions). To implement this XML-based interoperation, we used NCBIs "blastall" (which gives an XML output option) and SGD\´s yeast nucleotide sequence dataset to establish a local blast server. Also, we converted the SGD\´s ORF location data file (which is available in tab-delimited formal) into an XML document based on the BIOML (BIOpolymer Markup Language) standard
Keywords :
DNA; biology computing; genetics; hypermedia markup languages; information resources; sequences; BIOML standard; BLAST searches; Internet; XML application; chromosomal locations; data exchange; genome Web sites; genomic data interoperation; open reading frames; protein encoding; transposon insertions; yeast DNA sequences; Application software; Bioinformatics; DNA; Fungi; Genomics; Internet; Laboratories; Publishing; Sequences; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering Conference, 2001. Proceedings of the IEEE 2nd International Symposium on
Conference_Location :
Bethesda, MD
Print_ISBN :
0-7695-1423-5
Type :
conf
DOI :
10.1109/BIBE.2001.974417
Filename :
974417
Link To Document :
بازگشت