DocumentCode
579015
Title
Using Papers Citations for Selecting the Best Genomic Databases
Author
Lichtnow, D. ; Alves, Renan ; de Oliveira, J.P.M. ; Levin, A. ; Pastor, O. ; Castello, I.M. ; Dopazo, Joaquin
Author_Institution
Inst. de Inf., Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
fYear
2011
fDate
9-11 Nov. 2011
Firstpage
33
Lastpage
42
Abstract
Selecting the right data is an essential activity in Genomic-related Information Systems. This work aims to analyze if it is possible to select the best genomic databases from a catalog using information about papers citations related to these genomic databases. The motivation for using information about citations has to do with the fact that it is not easy to obtain proper metadata with respect to these databases. Thus, in this work, information related to papers citations is used for measuring three distinct data quality dimensions: believability, timeliness, and relevancy. Believability is evaluated through the inspection of the number of citations. The variation of the number of citations over time is useful for determining the recency of a database and it is related to the timeliness dimension. Regarding to relevancy, the keywords of papers are useful to indicate the main context of application of these databases.
Keywords
biology computing; genomics; information systems; data quality dimensions; genomic databases; genomic-related information systems; paper citations; timeliness dimension; Bioinformatics; Catalogs; Context; Databases; Genomics; Google; Web sites; Database selection; database catalogs; quality indicators;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science Society (SCCC), 2011 30th International Conference of the Chilean
Conference_Location
Curico
ISSN
1522-4902
Print_ISBN
978-1-4673-1364-3
Type
conf
DOI
10.1109/SCCC.2011.6
Filename
6363380
Link To Document