• DocumentCode
    1821897
  • Title

    A threshold method for Imbalanced Multiple Noisy Labeling

  • Author

    Jing Zhang ; Xindong Wu ; Sheng, Victor S.

  • Author_Institution
    Dept. of Comput. Sci., Hefei Univ. of Technol., Hefei, China
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    61
  • Lastpage
    65
  • Abstract
    Internet-based crowdsourcing systems can be viewed as a kind of loosely coupled social networks. With these systems, it is easy to collect multiple noisy labels for the same object when conducting annotation for supervised learning. Because non-expert labelers lack expertise and dedication, and have strong personal preference, they may have bias when labeling. These cause Imbalanced Multiple Noisy Labeling. In this paper, we propose an agnostic algorithm Positive LAbel frequency Threshold (PLAT) to deal with imbalanced labeling. Because of the dynamics of social networks, in most cases no information about the qualities of labelers and underlying class distributions can be acquired. PLAT does not require prior knowledge of the labeling qualities of labelers, the underlying class distributions, and the level of labeling imbalance. Simulations on eight real-world datasets with different underlying class distributions demonstrate that PLAT not only effectively deals with the imbalanced multiple noisy labeling that off-the-shelf agnostic methods cannot cope with, but also performs nearly the same as majority voting under the circumstances that labelers have no bias.
  • Keywords
    Internet; learning (artificial intelligence); social networking (online); Internet-based crowdsourcing systems; PLAT; class distributions; imbalanced multiple noisy labeling; positive label frequency threshold method; social network dynamics; supervised learning; Accuracy; Conferences; Data mining; Labeling; Noise measurement; Social network services; Training; classification; crowdsourcing; imbalance labeling; multiple noisy labels; outsourcing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
  • Conference_Location
    Niagara Falls, ON
  • Type

    conf

  • Filename
    6785688