DocumentCode
2839133
Title
Improving Chinese text Chunkings precision using Transformation-based Learning
Author
Liu, Ying ; Liao, Panpan
Author_Institution
Tsinghua Univ., Beijing
fYear
2006
fDate
15-17 Dec. 2006
Firstpage
2480
Lastpage
2485
Abstract
Based on text chunking using HMM, transformation-based learning is made use of to improve the precision of chunk tags further. The training data and the test data are from Penn treebank 4.0, and 13 text chunks are used. Rules are learned automatically according to the rule templates. The precision is improved 4.48%. The detailed analysis that affects the text chunking is given. Different threshold, different scale of training data, different learning equation and different rule templates can affect the precision of the text chunking.
Keywords
hidden Markov models; learning (artificial intelligence); natural language processing; text analysis; Chinese text chunking; Penn treebank 4.0; hidden Markov model; learning equation; rule templates; transformation-based learning; Entropy; Hidden Markov models; Learning systems; Machine learning; Mathematical model; Mutual information; Support vector machine classification; Support vector machines; Testing; Training data; Text chunking; transformation-based Learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Technology, 2006. ICIT 2006. IEEE International Conference on
Conference_Location
Mumbai
Print_ISBN
1-4244-0726-5
Electronic_ISBN
1-4244-0726-5
Type
conf
DOI
10.1109/ICIT.2006.372688
Filename
4238010
Link To Document