• DocumentCode
    2839133
  • Title

    Improving Chinese text Chunkings precision using Transformation-based Learning

  • Author

    Liu, Ying ; Liao, Panpan

  • Author_Institution
    Tsinghua Univ., Beijing
  • fYear
    2006
  • fDate
    15-17 Dec. 2006
  • Firstpage
    2480
  • Lastpage
    2485
  • Abstract
    Based on text chunking using HMM, transformation-based learning is made use of to improve the precision of chunk tags further. The training data and the test data are from Penn treebank 4.0, and 13 text chunks are used. Rules are learned automatically according to the rule templates. The precision is improved 4.48%. The detailed analysis that affects the text chunking is given. Different threshold, different scale of training data, different learning equation and different rule templates can affect the precision of the text chunking.
  • Keywords
    hidden Markov models; learning (artificial intelligence); natural language processing; text analysis; Chinese text chunking; Penn treebank 4.0; hidden Markov model; learning equation; rule templates; transformation-based learning; Entropy; Hidden Markov models; Learning systems; Machine learning; Mathematical model; Mutual information; Support vector machine classification; Support vector machines; Testing; Training data; Text chunking; transformation-based Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Technology, 2006. ICIT 2006. IEEE International Conference on
  • Conference_Location
    Mumbai
  • Print_ISBN
    1-4244-0726-5
  • Electronic_ISBN
    1-4244-0726-5
  • Type

    conf

  • DOI
    10.1109/ICIT.2006.372688
  • Filename
    4238010