• DocumentCode
    48631
  • Title

    Learning Phrase Patterns for Text Classification

  • Author

    Bin Zhang ; Marin, A. ; Hutchinson, Brian ; Ostendorf, Mari

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
  • Volume
    21
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    1180
  • Lastpage
    1189
  • Abstract
    This paper introduces methods to discriminatively learn phrase patterns for use as features in text classification. An efficient solution is described using a recursive algorithm with a mutual information selection criterion. The algorithm automatically determines when word classes are useful in specific locations of a phrase pattern, allowing for variable specificity depending on the amount of labeled data available. Experiments are carried out on three text classification tasks in both English and Chinese, resulting in improved performance when adding the phrase patterns to the existing n-gram features.
  • Keywords
    feature extraction; text detection; feature extractor; learning phrase pattern; mutual information selection criterion; recursive algorithm; text classification; text detection; Abstracts; Context; Feature extraction; Materials; Mutual information; Natural language processing; Pattern matching; Mutual information; natural language processing; phrase pattern; text classification;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2245651
  • Filename
    6457440