• DocumentCode
    69792
  • Title

    Discover Novel Visual Categories From Dynamic Hierarchies Using Multimodal Attributes

  • Author

    Jianhua Zhang ; Jianwei Zhang ; Shengyong Chen

  • Author_Institution
    Univ. of Hamburg, Hamburg, Germany
  • Volume
    9
  • Issue
    3
  • fYear
    2013
  • fDate
    Aug. 2013
  • Firstpage
    1688
  • Lastpage
    1696
  • Abstract
    Learning novel visual categories from observations and experiences in unexplored environment is a vitally important cognitive ability for human beings. A dynamic category hierarchy that is an inherent structure in a human mind is a key component for this ability. This paper develops a framework to build dynamic category hierarchy based on object attributes and a topic model. Since humans trend to utilize multimodal information to learn novel categories, we also develop an algorithm to learn multimodal object attributes from multimodal data. The new multimodal attributes can describe objects efficiently and can generalize from learned categories to novel ones. By comparison with a state-of-the-art unimodal attribute, the multimodal attributes can achieve 4%-19% improvements on average. We also develop a constrained topic model, which can accurately construct category hierarchies for large-scale categories. Based on them, the novel framework can effectively detect novel categories and relate them with known categories for further category learning. Extensive experiments are conducted using a public multimodal dataset, i.e., color and point cloud data, to evaluate the multimodal attributes and the dynamic category hierarchy. The experimental results show the effectiveness of multimodal attributes to describe objects and the satisfactory performance of the dynamic category hierarchy to discover novel categories. By comparison with state-of-the-art methods, the dynamic category hierarchy achieves 7% improvements.
  • Keywords
    data mining; learning (artificial intelligence); color data; constrained topic model; dynamic category hierarchy; multimodal object attribute; point cloud data; visual category learning; Histograms; Humans; Informatics; Resource management; Semantics; Shape; Visualization; Constrained topic model; RGB-D data; dynamic category hierarchies; multimodal object attributes; multimodal sensor; novel visual category discovery;
  • fLanguage
    English
  • Journal_Title
    Industrial Informatics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1551-3203
  • Type

    jour

  • DOI
    10.1109/TII.2013.2248741
  • Filename
    6470680