DocumentCode :
3489743
Title :
Named Entity Recognition for Vietnamese Documents Using Semi-supervised Learning Method of CRFs with Generalized Expectation Criteria
Author :
Thi-Ngan Pham ; Le Minh Nguyen ; Quang-Thuy Ha
Author_Institution :
Vietnamese People´´s Police Acad., Hanoi, Vietnam
fYear :
2012
fDate :
13-15 Nov. 2012
Firstpage :
85
Lastpage :
88
Abstract :
Named Entity Recognition (NER) is an important, useful task in many natural language processing applications and much previous work in NER has been done in many other languages such as English, Japanese, Chinese However, Vietnamese NER task is still relatively new and challenge due to the characteristics of Vietnamese, the lack of a large annotated corpus This paper presents a new approach for Vietnamese NER -- a semi-supervised training method for Conditional random fields (CRFs) models using generalized expectation criteria to express a preference for parameter settings. We perform several experiments using different feature setting and different training data to show the high performance of this method and compare to the other method.
Keywords :
document handling; learning (artificial intelligence); natural language processing; random processes; CRF model; Vietnamese NER task; Vietnamese documents; conditional random fields; feature setting; generalized expectation criteria; named entity recognition; natural language processing; semisupervised learning method; semisupervised training method; training data; Data models; Hidden Markov models; Labeling; Semisupervised learning; Testing; Training; Training data; CRFs; Generalized Expectation criteria; semi-supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
Type :
conf
DOI :
10.1109/IALP.2012.54
Filename :
6473702
Link To Document :
بازگشت