• DocumentCode
    1500952
  • Title

    Enriching One Taxonomy Using Another

  • Author

    Subramaniam, L. Venkata ; Nanavati, Amit Anil ; Mukherjea, Sougata

  • Author_Institution
    IBM India Res. Lab., New Delhi, India
  • Volume
    22
  • Issue
    10
  • fYear
    2010
  • Firstpage
    1415
  • Lastpage
    1427
  • Abstract
    Taxonomies, representing hierarchical data, are a key knowledge source in multiple disciplines. Information processing across taxonomies is not possible unless they are appropriately merged for commonalities and differences. For taxonomy merging, the first task is to identify common concepts between the taxonomies. Then, these common concepts along with their associated concepts in the two taxonomies need to be integrated. Doing this in a conflict-free manner is a challenging task and generally requires human intervention. In this paper, we explore the possibility of asymmetrically merging one taxonomy into another automatically. Given one or more source taxonomies and a destination taxonomy, modeled as directed acyclic graphs, we present intuitive algorithms that merge relevant portions of the source taxonomies into the destination taxonomy. We prove that our algorithms are conflict-free, information lossless, and scalable. We also define precision and recall measures for evaluating enriched taxonomies, such as TA, the result of merging two taxonomies, with TI, the ideal merger. Our experiments indicate the effectiveness of our approach.
  • Keywords
    classification; data analysis; directed graphs; merging; directed acyclic graphs; hierarchical data; information processing; taxonomy merging; Biology computing; Corporate acquisitions; Humans; Information processing; Merging; Object oriented modeling; Ontologies; Software algorithms; Taxonomy; Unified modeling language; Taxonomy merging; graph merging algorithms.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2009.189
  • Filename
    5288524