• DocumentCode
    1137022
  • Title

    Multicategory Prediction of Multifactorial Diseases Through Risk Factor Fusion and Rank-Sum Selection

  • Author

    Phegley, James W. ; Perkins, Kyle ; Gupta, Lalit ; Hughes, Larry F.

  • Author_Institution
    Acad. Affairs & Res., Southern Illinois Univ., Carbondale, IL, USA
  • Volume
    35
  • Issue
    5
  • fYear
    2005
  • Firstpage
    718
  • Lastpage
    726
  • Abstract
    A generalized strategy is developed to predict the occurrence of a multicategory–multifactorial disease from a set of medical risk factors that are most often used to screen patients for the disease. The prediction problem is formulated as an M -class classification problem. The strategy employs fusion to combine risk factors into a single feature vector, normalization to fuse risk factors which have different formats and ranges, rank-sum ordering for feature selection, discrete Karhunen–Loeve transform-based transformation to facilitate parametric classifier development, and the design of parametric classifiers. Two methods, which differ on how the features are selected, are developed. In the first method, features are selected from a set consisting of linear combinations of all risk factors. In the second method, the features are linear combinations of a preselected subset of the risk factors. The methods are applied to predict the occurrence of Alzheimer\´s disease (AD) into three classes: Probable-AD, Possible-AD, and Uncertain. It is shown that a classification accuracy of over 71% can be obtained. This result is quite encouraging given that AD is very difficult to clinically diagnose. Higher classification accuracies can be expected for diseases that are not as complex to diagnose as AD. Most importantly, it is concluded that the generalized strategy can not only be applied to the multicategory–multifactorial disease prediction problem but also to other multiclass pattern recognition problems involving diverse information collected from different sources.
  • Keywords
    Karhunen-Loeve transforms; discrete transforms; diseases; patient diagnosis; pattern classification; M-class classification; discrete Karhunen Loeve transform; feature selection; medical risk factor fusion; multicategory prediction; multifactorial disease; pattern recognition; rank sum selection; single feature vector; Alzheimer´s disease; Discrete transforms; Fuses; History; Medical diagnostic imaging; Medical tests; Pattern recognition; Testing; Vectors; Alzheimer´s disease; feature selection; fusion; multiclass classification; rank ordering;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4427
  • Type

    jour

  • DOI
    10.1109/TSMCA.2005.843390
  • Filename
    1495613