• DocumentCode
    1788059
  • Title

    Medical data mining: A case study of a Paracoccidioidomycosis patient´s database

  • Author

    Liboredo Ferreira, Eduardo ; Rausch, Herbert ; Campos, Sergio ; Faria-Campos, Alessandra ; Pietra, Enio ; da Silva Santos, Lilian

  • Author_Institution
    Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte, Brazil
  • fYear
    2014
  • fDate
    15-18 Oct. 2014
  • Firstpage
    275
  • Lastpage
    280
  • Abstract
    Data mining applied to medical databases is a challenging process. The unavailability of large sources of data and data complexity are some of the difficulties encountered. This is especially true for rare and neglected diseases. Those databases are, in general, relatively small, wide and sparse, making them very challenging to analyze. There are also ethical, legal and social issues regarding privacy and clinical validation of the findings. This work proposes a way of dealing with this challenge with a case study of data mining applied in a Paracoccidioidomycosis (PCM) patients database. Paracoccidioidomycosis (PCM) is a typical Brazilian disease, caused by the yeast Paracoccidioides brasiliensis. This disease represents an important Public Health issue, due to its high incapacitating potential and the amount of premature deaths it causes if untreated. This paper discusses methods for the analysis of this complex dataset, to help increase the understanding of both the disease and this type of data. Despite the challenges of the dataset, some interesting findings were made being: flaws in form filling protocols, notably the lack of chest X-ray in 40% of the records; the discovery of a possible new relation between smoking habits and PCM evolution time. The average evolution time for smoking patients was 2.8 times longer; the successful classification/prediction of the cutaneous form of the disease with a 93% precision rate are some of the discoveries made.
  • Keywords
    data mining; data privacy; diseases; medical information systems; Brazilian disease; PCM evolution time; PCM patient database; Paracoccidioides brasiliensis; Paracoccidioidomycosis patient database; X-ray; clinical validation; data complexity; data privacy; data sources; medical data mining; medical databases; public health issue; smoking; Algorithm design and analysis; Data mining; Databases; Diseases; Lesions; Phase change materials; Skin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Health Networking, Applications and Services (Healthcom), 2014 IEEE 16th International Conference on
  • Conference_Location
    Natal
  • Type

    conf

  • DOI
    10.1109/HealthCom.2014.7001854
  • Filename
    7001854