• DocumentCode
    2506206
  • Title

    Enhanced data extraction, transforming and loading processing for Traditional Chinese Medicine clinical data warehouse

  • Author

    Pan, Xishui ; Zhou, Xuezhong ; Song, Hongmei ; Zhang, Runshun ; Zhang, Tingting

  • Author_Institution
    Sch. of Comput. Sci. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
  • fYear
    2012
  • fDate
    10-13 Oct. 2012
  • Firstpage
    57
  • Lastpage
    61
  • Abstract
    Clinical data warehouse has been developed as a fundamental data infrastructure for large scale TCM clinical data management and decision support services. However, as a key component, data extraction, transforming and loading (ETL) is a complicated and labor intensive task to ensure high data quality before all kinds of data analyses. This paper introduces an enhanced ETL technique framework, which includes operational data store (ODS) model and two step data preprocessing subcomponents, to perform the ETL tasks. The ODS data model was designed to integrate the heterogeneous clinical data sources and support the direct copy from these data sources to ODS database by ETL. Therefore, ETL task has been separated into two core steps in enhanced ETL component: (1) dynamic filter and copy of the original operational data sources to ODS; (2) specialized transforming the ODS data to detailed clinical data warehouse. This enhanced technique framework improves the ETL performance to be used in clinical data center since there would have various kinds of operational data sources that need be integrated in this data environments. This paper has a description of the related enhanced ETL framework and proposes some key procedures to accomplish the tasks.
  • Keywords
    data handling; data warehouses; decision support systems; information retrieval; medical information systems; ETL tasks; ETL technique framework; ODS data model; TCM clinical data management; TCM clinical data warehouse; TCM decision support services; data copying; data extraction; data infrastructure; data loading processing; data transformation; dynamic data filtering; heterogeneous clinical data sources; operational data sources; operational data store model; traditional Chinese medicine; two step data preprocessing subcomponents; Data analysis; Data mining; Data models; Data warehouses; Databases; Hospitals; Medical diagnostic imaging; clinical data warehouse; detailed data warehouse; extraction-transforming-loading; operational data store;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Health Networking, Applications and Services (Healthcom), 2012 IEEE 14th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4577-2039-0
  • Electronic_ISBN
    978-1-4577-2038-3
  • Type

    conf

  • DOI
    10.1109/HealthCom.2012.6380066
  • Filename
    6380066