Title :
Graph databases for large-scale healthcare systems: A framework for efficient data management and data services
Author :
Yubin Park ; Shankar, M. ; Byung-Hoon Park ; Ghosh, Joydeb
Author_Institution :
Univ. of Texas at Austin, Austin, TX, USA
fDate :
March 31 2014-April 4 2014
Abstract :
Designing a database system for both efficient data management and data services has been one of the enduring challenges in the healthcare domain. In many healthcare systems, data services and data management are often viewed as two orthogonal tasks; data services refer to retrieval and analytic queries such as search, joins, statistical data extraction, and simple data mining algorithms, while data management refers to building error-tolerant and non-redundant database systems. The gap between service and management has resulted in rigid database systems and schemas that do not support effective analytics. We compose a rich graph structure from an abstracted healthcare RDBMS to illustrate how we can fill this gap in practice. We show how a healthcare graph can be automatically constructed from a normalized relational database using the proposed “3NF Equivalent Graph” (3EG) transformation. We discuss a set of real world graph queries such as finding self-referrals, shared providers, and collaborative filtering, and evaluate their performance over a relational database and its 3EG-transformed graph. Experimental results show that the graph representation serves as multiple de-normalized tables, thus reducing complexity in a database and enhancing data accessibility of users. Based on this finding, we propose an ensemble framework of databases for healthcare applications.
Keywords :
collaborative filtering; data handling; graph theory; health care; medical computing; query processing; relational databases; 3EG transformation; 3NF equivalent graph transformation; abstracted healthcare RDBMS; analytic query; collaborative filtering; data management; data mining algorithms; data services; error-tolerant database systems; graph database system; healthcare graph; large-scale healthcare systems; multiple de-normalized tables; nonredundant database systems; normalized relational database; orthogonal tasks; statistical data extraction; Collaboration; Database systems; Medical services; Object oriented modeling; Relational databases; Resource description framework;
Conference_Titel :
Data Engineering Workshops (ICDEW), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
DOI :
10.1109/ICDEW.2014.6818295