• DocumentCode
    610320
  • Title

    Attribute extraction and scoring: A probabilistic approach

  • Author

    Taesung Lee ; Zhongyuan Wang ; Haixun Wang ; Seung-won Hwang

  • Author_Institution
    POSTECH, Pohang, South Korea
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    194
  • Lastpage
    205
  • Abstract
    Knowledge bases, which consist of concepts, entities, attributes and relations, are increasingly important in a wide range of applications. We argue that knowledge about attributes (of concepts or entities) plays a critical role in inferencing. In this paper, we propose methods to derive attributes for millions of concepts and we quantify the typicality of the attributes with regard to their corresponding concepts. We employ multiple data sources such as web documents, search logs, and existing knowledge bases, and we derive typicality scores for attributes by aggregating different distributions derived from different sources using different methods. To the best of our knowledge, ours is the first approach to integrate concept- and instance-based patterns into probabilistic typicality scores that scale to broad concept space. We have conducted extensive experiments to show the effectiveness of our approach.
  • Keywords
    inference mechanisms; knowledge based systems; Web documents; attribute extraction; attribute scoring; concept-based patterns; instance-based patterns; knowledge bases; multiple data sources; probabilistic approach; probabilistic typicality scores; search logs; Companies; Data mining; Knowledge based systems; Probabilistic logic; Sociology; Statistics; Syntactics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-4909-3
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2013.6544825
  • Filename
    6544825