• DocumentCode
    253543
  • Title

    Measuring Distance between Unordered Sets of Different Sizes

  • Author

    Gardner, Andrew ; Kanno, Jinko ; Duncan, Christian A. ; Selmic, Rastko

  • Author_Institution
    Louisiana Tech Univ., Ruston, LA, USA
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    137
  • Lastpage
    143
  • Abstract
    We present a distance metric based upon the notion of minimum-cost injective mappings between sets. Our function satisfies metric properties as long as the cost of the minimum mappings is derived from a semimetric, for which the triangle inequality is not necessarily satisfied. We show that the Jaccard distance (alternatively biotope, Tanimoto, or Marczewski-Steinhaus distance) may be considered the special case for finite sets where costs are derived from the discrete metric. Extensions that allow premetrics (not necessarily symmetric), multisets (generalized to include probability distributions), and asymmetric mappings are given that expand the versatility of the metric without sacrificing metric properties. The function has potential applications in pattern recognition, machine learning, and information retrieval.
  • Keywords
    functions; set theory; Jaccard distance; asymmetric mappings; distance measurement; distance metric; information retrieval; machine learning; minimum-cost injective mappings; pattern recognition; Atmospheric measurements; Earth; Extraterrestrial measurements; Indexes; Particle measurements; Vectors; Earth Mover´s Distance; Jaccard Index; Metric;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.25
  • Filename
    6909419