• DocumentCode
    3730560
  • Title

    Lao Named Entity Recognition based on conditional random fields with simple heuristic information

  • Author

    Mengjie Yang; Lanjiang Zhou; Zhengtao Yu; Shengxiang Gao; Jianyi Guo

  • Author_Institution
    Sch. of Inf. Eng. &
  • fYear
    2015
  • Firstpage
    1426
  • Lastpage
    1431
  • Abstract
    According to characteristics of Lao named entities, the paper proposes an approach of Lao Named Entity Recognition (NER) based on Conditional Random Fields (CRFs) with knowledge information. Firstly, we segment the text into word sequence and design three labels BIO1 for personal name and location name entity recognition. Secondly, some named entity features of Lao Language are selected for Conditional Random Fields (CRFs) model, such as the clue word feature, the predicate feature etc.. Then, candidate named entities are recognized. Thirdly, we extract simple personal name and location name features of Lao Language to build heuristic information, and use the heuristic information to determine candidate named entities. Finally, named entities which have not been discovered by Conditional Random Fields (CRFs) model are further recognized by using the named entities word list, and these final named entities are obtained. The experimental results show that the method proposed is effective, and it can improve the effect of named entity recognition by using machine learning method with heuristic information.
  • Keywords
    "Speech recognition","Character recognition","Speech","Context","Yttrium","Feature extraction","Learning systems"
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
  • Type

    conf

  • DOI
    10.1109/FSKD.2015.7382153
  • Filename
    7382153