DocumentCode
3423323
Title
Exploiting agent and database technologies for biological data collection
Author
Davulcu, Hasan ; Lacroix, Zoe ; Parekh, Kaushal ; Ramakrishnan, I.V. ; Julasana, Nikeeta
Author_Institution
Arizona State Univ., AZ, USA
fYear
2004
fDate
30 Aug.-3 Sept. 2004
Firstpage
376
Lastpage
381
Abstract
Web data sources constitute an important resource for biological research. A simple tool that can retrieve information from different Web sites through a single interface and store the extracted data in a standardized format for efficient future use is critical to scientific discovery. We discuss an approach that combines agent and database technologies for biological data integration. To illustrate this, we employ two software tools: WinAgent, for building agents, and dbXML, for XML data management. WinAgent learns from its users by recording a browsing session on Web sites and successive data extraction from regions of interest on retrieved Web pages. The results are stored in a XML document and can be managed, queried and updated using a native XML database system such as dbXML. This approach is currently being evaluated at the Brain Tumor Cancer Unit of the Translational Genomics Research Institute (TGen), Phoenix, Arizona.
Keywords
Web sites; XML; biology computing; query processing; scientific information systems; software agents; Web data sources; Web sites; XML; biological data integration; biology computing; data extraction; database technology; query processing; scientific information systems; software agents; Bioinformatics; Cancer; Data mining; Database systems; Genomics; Information retrieval; Neoplasms; Software tools; Web pages; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2004. Proceedings. 15th International Workshop on
ISSN
1529-4188
Print_ISBN
0-7695-2195-9
Type
conf
DOI
10.1109/DEXA.2004.1333503
Filename
1333503
Link To Document