Title of article :
Minimum tag error for discriminative training of conditional random fields
Author/Authors :
Ying-Xiong Qiu، نويسنده , , Jie Zhu، نويسنده , , Hao Huang، نويسنده , , Haihua Xu، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2009
Pages :
11
From page :
169
To page :
179
Abstract :
This paper proposes a new criterion called minimum tag error (MTE) for discriminative training of conditional random fields (CRFs). The new criterion, which is a smoothed approximation to the sentence labeling error, aims to maximize an average of transcription tagging accuracies of all possible sentences, weighted by their probabilities. Corpora from the second international Chinese word segmentation bakeoff (Bakeoff 2005) are used to test the effectiveness of this new training criterion. The experimental results have demonstrated that the proposed minimum tag error criterion can reliably improve the initial performance of supervised conditional random fields. In particular, the recall rate of out-of-vocabulary words (Roov) is significantly improved compared with that obtained using standard conditional random fields. Furthermore, the new training method has the advantage of robustness to segmentation across all datasets.
Keywords :
Machine Learning , Natural language processing , conditional random fields , Chinese word segmentation , discriminative training
Journal title :
Information Sciences
Serial Year :
2009
Journal title :
Information Sciences
Record number :
1212161
Link To Document :
بازگشت