DocumentCode
2506206
Title
Enhanced data extraction, transforming and loading processing for Traditional Chinese Medicine clinical data warehouse
Author
Pan, Xishui ; Zhou, Xuezhong ; Song, Hongmei ; Zhang, Runshun ; Zhang, Tingting
Author_Institution
Sch. of Comput. Sci. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
fYear
2012
fDate
10-13 Oct. 2012
Firstpage
57
Lastpage
61
Abstract
Clinical data warehouse has been developed as a fundamental data infrastructure for large scale TCM clinical data management and decision support services. However, as a key component, data extraction, transforming and loading (ETL) is a complicated and labor intensive task to ensure high data quality before all kinds of data analyses. This paper introduces an enhanced ETL technique framework, which includes operational data store (ODS) model and two step data preprocessing subcomponents, to perform the ETL tasks. The ODS data model was designed to integrate the heterogeneous clinical data sources and support the direct copy from these data sources to ODS database by ETL. Therefore, ETL task has been separated into two core steps in enhanced ETL component: (1) dynamic filter and copy of the original operational data sources to ODS; (2) specialized transforming the ODS data to detailed clinical data warehouse. This enhanced technique framework improves the ETL performance to be used in clinical data center since there would have various kinds of operational data sources that need be integrated in this data environments. This paper has a description of the related enhanced ETL framework and proposes some key procedures to accomplish the tasks.
Keywords
data handling; data warehouses; decision support systems; information retrieval; medical information systems; ETL tasks; ETL technique framework; ODS data model; TCM clinical data management; TCM clinical data warehouse; TCM decision support services; data copying; data extraction; data infrastructure; data loading processing; data transformation; dynamic data filtering; heterogeneous clinical data sources; operational data sources; operational data store model; traditional Chinese medicine; two step data preprocessing subcomponents; Data analysis; Data mining; Data models; Data warehouses; Databases; Hospitals; Medical diagnostic imaging; clinical data warehouse; detailed data warehouse; extraction-transforming-loading; operational data store;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Health Networking, Applications and Services (Healthcom), 2012 IEEE 14th International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4577-2039-0
Electronic_ISBN
978-1-4577-2038-3
Type
conf
DOI
10.1109/HealthCom.2012.6380066
Filename
6380066
Link To Document