• DocumentCode
    140787
  • Title

    Head, modifier, and constraint detection in short texts

  • Author

    Zhongyuan Wang ; Haixun Wang ; Zhirui Hu

  • Author_Institution
    Sch. of Inf., Renmin Univ. of China, Beijing, China
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    280
  • Lastpage
    291
  • Abstract
    Head and modifier detection is an important problem for applications that handle short texts such as search queries, ads keywords, titles, captions, etc. In many cases, short texts such as search queries do not follow grammar rules, and existing approaches for head and modifier detection are coarse-grained, domain specific, and/or require labeling of large amounts of training data. In this paper, we introduce a semantic approach for head and modifier detection. We first obtain a large number of instance level head-modifier pairs from search log. Then, we develop a conceptualization mechanism to generalize the instance level pairs to concept level. Finally, we derive weighted concept patterns that are concise, accurate, and have strong generalization power in head and modifier detection. Furthermore, we identify a subset of modifiers that we call constraints. Constraints are usually specific and not negligible as far as the intent of the short text is concerned, while non-constraint modifiers are more subjective. The mechanism we developed has been used in production for search relevance and ads matching. We use extensive experiment results to demonstrate the effectiveness of our approach.
  • Keywords
    data mining; generalisation (artificial intelligence); query processing; text analysis; ads matching; conceptualization mechanism; constraint detection; generalization power; grammar rules; head detection; instance level head-modifier pairs; modifier detection; search log; search queries; search relevance; semantic approach; short text handling; weighted concept patterns; Companies; Educational institutions; Grammar; Head; Magnetic heads; Pattern matching; Taxonomy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816658
  • Filename
    6816658