• DocumentCode
    56991
  • Title

    Data Mining in Bone Marrow Transplant Records to Identify Patients With High Odds of Survival

  • Author

    Taati, Babak ; Snoek, Jasper ; Aleman, Dionne ; Ghavamzadeh, Ardeshir

  • Author_Institution
    Toronto Rehabilitation Inst., Univ. Health Network, Toronto, ON, Canada
  • Volume
    18
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    21
  • Lastpage
    27
  • Abstract
    Patients undergoing a bone marrow stem cell transplant (BMT) face various risk factors. Analyzing data from past transplants could enhance the understanding of the factors influencing success. Records up to 120 measurements per transplant procedure from 1751 patients undergoing BMT were collected (Shariati Hospital). Collaborative filtering techniques allowed the processing of highly sparse records with 22.3% missing values. Ten-fold cross-validation was used to evaluate the performance of various classification algorithms trained on predicting the survival status. Modest accuracy levels were obtained in predicting the survival status (AUC = 0.69). More importantly, however, operations that had the highest chances of success were shown to be identifiable with high accuracy, e.g., 92% or 97% when identifying 74 or 31 recipients, respectively. Identifying the patients with the highest chances of survival has direct application in the prioritization of resources and in donor matching. For patients where high-confidence prediction is not achieved, assigning a probability to their survival odds has potential applications in probabilistic decision support systems and in combination with other sources of information.
  • Keywords
    collaborative filtering; data mining; medical expert systems; medical information systems; patient treatment; pattern classification; BMT success; Shariati Hospital; bone marrow stem cell transplant; bone marrow transplant records; classification algorithm training; collaborative filtering techniques; data mining; donor matching; highly sparse data processing; past transplant data; patient risk factors; patient survival rate; probabilistic decision support systems; resource prioritization; survival status prediction; Bayes methods; Collaboration; Matrix decomposition; Optimization; Probabilistic logic; Robustness; Sparse matrices; Bone marrow transplant; collaborative filtering; data mining; donor matching; health records; matrix factorization; principal component analysis; recommender systems; survival;
  • fLanguage
    English
  • Journal_Title
    Biomedical and Health Informatics, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    2168-2194
  • Type

    jour

  • DOI
    10.1109/JBHI.2013.2274733
  • Filename
    6567920