• DocumentCode
    3016971
  • Title

    Model evaluation of datasets using critical dimension model invariants

  • Author

    Suryakumar, Divya ; Sung, Andrew H. ; Mazumdar, Subhra ; Liu, Quanwei

  • Author_Institution
    Dept. of Comput. Sci. & Eng., New Mexico Inst. of Min. & Technol., Socorro, TX, USA
  • fYear
    2012
  • fDate
    27-29 Nov. 2012
  • Firstpage
    740
  • Lastpage
    745
  • Abstract
    Critical dimension is the minimum number of features that is required to ensure the performance of a learning machine to be “high”. This critical dimension is usually unique to the learning machine and the ranking algorithm combination. Medical- and bio-informatics datasets are different from most other datasets in that there is an imbalance in most of these datasets and a high prediction accuracy often depends upon not just the overall accuracy but also the true positive and the false negative rates. To find a medically and bio-informatically accurate critical dimension and for better analysis of such datasets we develop two evaluation models, one using all features and the other using critical number of features. The performance measurements such as accuracy, specificity, sensitivity, area under the curve, F-score and kappa values are compared. This paper shows that at the critical dimension the evaluation model shows good results for all performance measurements measured on most datasets studied. The difference in performance measurements obtained using only critical number and using all features is significantly less, i.e., there is not much difference in sensitivity, specificity and other measurements calculated.
  • Keywords
    bioinformatics; data mining; learning (artificial intelligence); bio-informatic datasets; bio-informatically accurate critical dimension; critical dimension model invariants; dataset model evaluation; false negative rates; learning machine performance; medical datasets; medically accurate critical dimension; performance measurements; prediction accuracy; ranking algorithm; true positive rates; Decision support systems; High definition video; Intelligent systems; Critical dimension; feature reduction; sensitivity and specificity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
  • Conference_Location
    Kochi
  • ISSN
    2164-7143
  • Print_ISBN
    978-1-4673-5117-1
  • Type

    conf

  • DOI
    10.1109/ISDA.2012.6416629
  • Filename
    6416629