• DocumentCode
    257532
  • Title

    An integrated method for hierarchy construction of domain-specific terms

  • Author

    Yin Kang ; Lina Zhou ; Dongsong Zhang

  • Author_Institution
    Dept. of Inf. Syst., Univ. of Maryland, Baltimore, MD, USA
  • fYear
    2014
  • fDate
    4-6 June 2014
  • Firstpage
    485
  • Lastpage
    490
  • Abstract
    Understanding text requires not only the extraction of individual concepts, but the identification of semantic relationships among concepts as well. Lexical resources have been applied to analyzing text in a wide range of applications. However, manual compilation of lexical resources is difficult to keep up with the rapid increase of the volume and diversity of user-generated content on the web. Automatic concept hierarchy construction has been considered as one solution to the above problem. Despite extensive effort on automatic construction of concept hierarchies, few studies have focused on the concepts of specific domains. In this study, we propose a comprehensive framework for building a domain-specific concept hierarchy. By synthesizing different types of measurements of relatedness among concepts, we propose an integrated method for building a multi-branch hierarchy of product features from online consumer reviews. The experiment results show that the proposed algorithm successfully reconstructs almost an entire hierarchy except for missing a few concepts and links. Starting from scratch, the algorithm reconstructed about 60% of the manually constructed hierarchy. The proposed method can be used to improve search results by better understanding user queries, and to facilitate personalized recommendations in e-commerce.
  • Keywords
    semantic Web; text analysis; vocabulary; Web; automatic concept hierarchy construction; domain-specific concept hierarchy; domain-specific term hierarchy construction; e-commerce; individual concept extraction; lexical resources; multibranch product feature hierarchy; online consumer reviews; semantic relationship identification; text analysis; text understanding; user queries; user-generated content; Buildings; Cameras; Feature extraction; Gold; Manuals; Semantics; Standards; Hierarchy building; domain-specific terms; mutli-branch clustering; term relatedness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Science (ICIS), 2014 IEEE/ACIS 13th International Conference on
  • Conference_Location
    Taiyuan
  • Type

    conf

  • DOI
    10.1109/ICIS.2014.6912181
  • Filename
    6912181