• DocumentCode
    3125726
  • Title

    Multi-instance Metric Learning

  • Author

    Xu, Ye ; Ping, Wei ; Campbell, Andrew T.

  • Author_Institution
    Comput. Sci. Dept., Dartmouth Coll., Hanover, NH, USA
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    874
  • Lastpage
    883
  • Abstract
    Multi-instance learning, like other machine learning and data mining tasks, requires distance metrics. Although metric learning methods have been studied for many years, metric learners for multi-instance learning remain almost untouched. In this paper, we propose a framework called Multi-Instance MEtric Learning (MIMEL) to learn an appropriate distance under the multi-instance setting. The distance metric between two bags is defined using the Mahalanobis distance function. The problem is formulated by minimizing the KL divergence between two multivariate Gaussians under the constraints of maximizing the between-class bag distance and minimizing the within-class bag distance. To exploit the mechanism of how instances determine bag labels in multi-instance learning, we design a nonparametric density-estimation-based weighting scheme to assign higher "weights" to the instances that are more likely to be positive in positive bags. The weighting scheme itself has a small workload, which adds little extra computing costs to the proposed framework. Moreover, to further boost the classification accuracy, a kernel version of MIMEL is presented. We evaluate MIMEL, using not only several typical multi-instance tasks, but also two activity recognition datasets. The experimental results demonstrate that MIMEL achieves better classification accuracy than many state-of-the-art distance based algorithms or kernel methods for multi-instance learning.
  • Keywords
    Gaussian processes; data mining; learning (artificial intelligence); KL divergence minimization; MIMEL; Mahalanobis distance function; between-class bag distance maximization; classification accuracy; data mining task; distance metrics; machine learning task; multiinstance metric learning; multivariate Gaussians; nonparametric density-estimation-based weighting scheme; within-class bag distance minimization; Data mining; Kernel; Learning systems; Machine learning; Mathematical model; Measurement; Training; Metric learning; Multi-instance learning; Weighting scheme;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.106
  • Filename
    6137292