DocumentCode :
2000437
Title :
Chinese Named Entity Recognition with CRFs: Two Levels
Author :
Hu, Hongping ; Zhang, Hui
Author_Institution :
State Key Lab. of Software Dev. Environ., China
Volume :
2
fYear :
2008
fDate :
13-17 Dec. 2008
Firstpage :
1
Lastpage :
6
Abstract :
Named entity recognition (NER) is one of the key techniques in natural language processing tasks such as information extraction, text summarization and so on. Chinese NER is more complicated and difficult than other languages because of its characteristics. This paper investigates Chinese named entity recognition based on CRFs, and implements three main named entities, person, location, and organization recognition in two levels: word level and character level. Experiments are made to compare the two level models¿ performances. In the experiments, different training scales and feature sets are utilized to look into the models¿ relationships with training corpus and their ability in making use of different features.
Keywords :
character recognition; natural language processing; probability; random processes; text analysis; CRF; Chinese named entity recognition; character level recognition; conditional random field; information extraction; natural language processing; probability; text summarization; word level recognition; Character recognition; Computational intelligence; Data mining; Hidden Markov models; Labeling; Natural language processing; Natural languages; Probability; Programming; Text recognition; CRFs; Chinese Named Entity Recognition; NER;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security, 2008. CIS '08. International Conference on
Conference_Location :
Suzhou
Print_ISBN :
978-0-7695-3508-1
Type :
conf
DOI :
10.1109/CIS.2008.72
Filename :
4724724
Link To Document :
بازگشت