• DocumentCode
    1479032
  • Title

    Nearest-Neighbor Guided Evaluation of Data Reliability and Its Applications

  • Author

    Boongoen, Tossapon ; Shen, Qiang

  • Author_Institution
    Dept. of Comput. Sci., Aberystwyth Univ., Aberystwyth, UK
  • Volume
    40
  • Issue
    6
  • fYear
    2010
  • Firstpage
    1622
  • Lastpage
    1633
  • Abstract
    The intuition of data reliability has recently been incorporated into the main stream of research on ordered weighted averaging (OWA) operators. Instead of relying on human-guided variables, the aggregation behavior is determined in accordance with the underlying characteristics of the data being aggregated. Data-oriented operators such as the dependent OWA (DOWA) utilize centralized data structures to generate reliable weights, however. Despite their simplicity, the approach taken by these operators neglects entirely any local data structure that represents a strong agreement or consensus. To address this issue, the cluster-based OWA (Clus-DOWA) operator has been proposed. It employs a cluster-based reliability measure that is effective to differentiate the accountability of different input arguments. Yet, its actual application is constrained by the high computational requirement. This paper presents a more efficient nearest-neighbor-based reliability assessment for which an expensive clustering process is not required. The proposed measure can be perceived as a stress function, from which the OWA weights and associated decision-support explanations can be generated. To illustrate the potential of this measure, it is applied to both the problem of information aggregation for alias detection and the problem of unsupervised feature selection (in which unreliable features are excluded from an actual learning process). Experimental results demonstrate that these techniques usually outperform their conventional state-of-the-art counterparts.
  • Keywords
    data structures; feature extraction; pattern clustering; reliability theory; alias detection; cluster-based reliability; clustering process; data oriented operator; data reliability; decision support explanation; human guided variable; information aggregation; local data structure; nearest neighbor guided evaluation; ordered weighted averaging operator; unsupervised feature selection; Arithmetic; Computer science; Councils; Data structures; Entropy; Gaussian distribution; Humans; Nearest neighbor searches; Open wireless architecture; Stress measurement; Alias detection; data reliability; nearest neighbor; ordered weighted averaging (OWA) aggregation; unsupervised feature selection; weight determination; Algorithms; Artificial Intelligence; Computer Simulation; Decision Support Techniques; Models, Theoretical; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2010.2043357
  • Filename
    5454303