Title :
Disclosure risk of individuals: A k-anonymity study on health care data related to Indian population
Author :
Panackal, Jisha Jose ; Pillai, Anitha S. ; Krishnachandran, V.N.
Author_Institution :
Dept. of Comput. Applic., Vidya Acad. of Sci. & Technol., Thrissur, India
Abstract :
Many private organizations are reluctant to share the health related information to the researchers fearing loss of privacy of data. The non availability of data has potential negative implications for the advancement of medical science, the development of new pharmaceutical products, better diagnosis of disease and for national and micro-level health planning. So in this context, the need for the development of reliable and robust anonymization techniques for data, especially relating to health care data has become equipped. This paper attempts to illustrate the disclosure risk of individuals´ health records related to Indian population and analyze the need for the development of suitable mechanisms to protect privacy of individuals. The data we have used for our evaluation purposes are made available to us by the nodal agency International Institute for Population Sciences (UPS), Mumbai. It is the data collected as part of the latest National Family Health Survey conducted in the year 2005, namely NFHS-3. Checking with k-anonymity property, the result shows that some of the individuals are at the risk of disclosure, if the actual table is linked to some other publicly available tables. This paper also attempts to illustrate the need for flexible selection of relevant attributes, especially the Quasi-Identifier (QI) attributes for the wide spread acceptance of knowledge-based systems as per the context.
Keywords :
data protection; health care; knowledge based systems; medical information systems; risk management; IBS nodal agency; Indian population; International Institute for Population Sciences; Mumbai; NFHS-3; National Family Health Survey; QI attributes; data anonymization techniques; data collection; data nonavailability; data privacy loss; disclosure risk; disease diagnosis; health care data; health records; health related information sharing; k-anonymity property; knowledge-based systems; microlevel health planning; national health planning; pharmaceutical products; privacy protection; private organizations; publicly available tables; quasiidentifier attributes; Data privacy; Diseases; Education; Joining processes; Privacy; Anonymization; Disclosure risk; Health care; Privacy; k-anonymity;
Conference_Titel :
Data Science & Engineering (ICDSE), 2014 International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4799-6870-1
DOI :
10.1109/ICDSE.2014.6974637