DocumentCode
3730560
Title
Lao Named Entity Recognition based on conditional random fields with simple heuristic information
Author
Mengjie Yang; Lanjiang Zhou; Zhengtao Yu; Shengxiang Gao; Jianyi Guo
Author_Institution
Sch. of Inf. Eng. &
fYear
2015
Firstpage
1426
Lastpage
1431
Abstract
According to characteristics of Lao named entities, the paper proposes an approach of Lao Named Entity Recognition (NER) based on Conditional Random Fields (CRFs) with knowledge information. Firstly, we segment the text into word sequence and design three labels BIO1 for personal name and location name entity recognition. Secondly, some named entity features of Lao Language are selected for Conditional Random Fields (CRFs) model, such as the clue word feature, the predicate feature etc.. Then, candidate named entities are recognized. Thirdly, we extract simple personal name and location name features of Lao Language to build heuristic information, and use the heuristic information to determine candidate named entities. Finally, named entities which have not been discovered by Conditional Random Fields (CRFs) model are further recognized by using the named entities word list, and these final named entities are obtained. The experimental results show that the method proposed is effective, and it can improve the effect of named entity recognition by using machine learning method with heuristic information.
Keywords
"Speech recognition","Character recognition","Speech","Context","Yttrium","Feature extraction","Learning systems"
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type
conf
DOI
10.1109/FSKD.2015.7382153
Filename
7382153
Link To Document