Title :
Named entity recognition in vietnamese text using label propagation
Author :
Huong Thanh Le ; Sam, Rathany Chan ; Hoan Cong Nguyen ; Thuy Thanh Nguyen
Author_Institution :
Hanoi Univ. of Sci. & Technol., Hanoi, Vietnam
Abstract :
This paper presents our named entity recognition system for Vietnamese text using labeled propagation. In here we propose: (i) a method of choosing noun phrases as the named entity candidates; (ii) a method to measure the word similarity; and (iii) a method of decreasing the effect of high frequency labels in labeled documents. Experimental results show that our labeled propagate method achieves higher accuracy than the old one [12]. In addition, when the number of the labeled data is small, its accuracy is higher than when using conditional random fields.
Keywords :
learning (artificial intelligence); natural language processing; text analysis; Vietnamese text; label propagation; labeled documents; named entity candidates; named entity recognition; noun phrases; word similarity measurement; Accuracy; Context; Frequency measurement; Semantics; Semisupervised learning; Supervised learning; Text recognition; Named entity recognition; labeled propagation; semi-supervised learning; words similarity;
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2013 International Conference of
Conference_Location :
Hanoi
Print_ISBN :
978-1-4799-3399-0
DOI :
10.1109/SOCPAR.2013.7054160