Title :
Feasibility of Using Clinical Element Models (CEM) to Standardize Phenotype Variables in the Database of Genotypes and Phenotypes (dbGaP)
Author :
Lin, Ko-Wei ; Tharp, Melissa ; Conway, Mike ; Ross, Mindy ; Hsieh, Alex ; Kim, Hyeon-Eui
Author_Institution :
Div. of Biomed. Inf., Univ. of California San Diego, La Jolla, CA, USA
Abstract :
The database of Genotypes and Phenotypes (dbGaP) contains various types of data generated in Genome Wide Association Studies (GWAS). These data can be used to facilitate novel scientific discovery and to reduce cost and time for exploratory research. However, idiosyncrasies in variable names become a major barrier for reusing these data. We studied the problem of formalizing the phenotype variable descriptions using Clinical Element Models (CEM). Direct mapping of 379 phenotype names to existing CEM yielded a low rate of exact matches (N=25). However, the flexible and expressive underlying information models of CEM provided a robust means of representing 115 phenotype variable descriptions, indicating that CEMs can be successfully applied to standardize a large portion of the clinical variables contained in dbGaP.
Keywords :
biology computing; genomics; standardisation; Genome Wide Association Studies; clinical element model feasibility; data generation; exploratory research; genotypes database; idiosyncrasies; information models; phenotype variable standardisation; phenotypes database; scientific discovery; Conferences; Data models; Databases; Diseases; Educational institutions; Informatics;
Conference_Titel :
Healthcare Informatics, Imaging and Systems Biology (HISB), 2012 IEEE Second International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-4803-4
DOI :
10.1109/HISB.2012.48