Title :
Automating the biological data collection process with agents
Author :
Lacroix, Zoé ; Parekh, Kaushal ; Davulcu, Hasan ; Ramakrishnan, I.V. ; Julasana, Nikeeta
Author_Institution :
Arizona State Univ., Tempe, AZ, USA
Abstract :
Scientists spend significant amount of time accessing Web resources, extracting information of interest, filtering, and integrating relevant data from multiple heterogeneous Web sites to support their data collection needs. This tedious collection process is typically performed manually as available technology does not allow scientists to explore and control their data collection process step by step. However, most of the process can be automated. While scripts (e.g., Perl) may be written to retrieve, parse and extract data of interest, many scientists are not programmers and do not have IT support. In contrast we propose an approach based on personal information agents (PIA) that provide scientists a user-friendly mechanism to automate their data collection processes without the need of any programming. This approach is currently being evaluated at the Brain Tumor Cancer Unit of the Translational Genomics Research Institute (TGen), Phoenix, Arizona.
Keywords :
Web sites; information retrieval; medical computing; Arizona; Brain Tumor Cancer Unit; Phoenix; Translational Genomics Research Institute; biological data collection; data extraction; data parsing; data retrieval; multiple heterogeneous Web sites; personal information agents; user-friendly mechanism; Automatic control; Automatic programming; Cancer; Data mining; Genomics; Information filtering; Information filters; Information retrieval; Neoplasms; Programming profession;
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
DOI :
10.1109/CSB.2004.1332470