• DocumentCode
    3292502
  • Title

    Combining Metric Features in Large Collections

  • Author

    Batko, Michal ; Kohoutkova, Petra ; Zezula, Pavel

  • Author_Institution
    Masaryk Univ., Brno
  • fYear
    2008
  • fDate
    11-12 April 2008
  • Firstpage
    79
  • Lastpage
    86
  • Abstract
    Current information systems are required to process complex digital objects, which are typically characterized by multiple descriptors. Since the values of many descriptors belong to non-sortable domains, they are effectively comparable only by a sort of similarity. Moreover, the scalability is very important in the current digital-explosion age. Therefore, we propose a distributed extension of the well-known threshold algorithm for peer-to-peer paradigm. The technique allows to answer similarity queries that combine multiple similarity measures and due to its peer-to- peer nature it is highly scalable. We also explore possibilities of approximate evaluation strategies, where some relevant results can be lost in favor of increasing the efficiency by order of magnitude. To reveal the strengths and weaknesses of our approach we have experimented with a 1.6 million image database from Flicker comparing the content of the images by five similarity measures from the MPEG-7 standard. To the best of our knowledge, the experience with such a huge real-life dataset is quite unique.
  • Keywords
    peer-to-peer computing; query processing; visual databases; MPEG-7 standard; complex digital objects; image database; information systems; metric features; peer-to-peer paradigm; similarity queries; Data mining; Image databases; Indexing; Informatics; Information systems; MPEG 7 Standard; Measurement standards; Peer to peer computing; Scalability; XML; multi-metric search; peer-to-peer data network; performance tuning; scalability; similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Similarity Search and Applications, 2008. SISAP 2008. First International Workshop on
  • Conference_Location
    Belfast
  • Print_ISBN
    0-7695-3101-6
  • Type

    conf

  • DOI
    10.1109/SISAP.2008.13
  • Filename
    4492928