Title :
Based cascaded conditional random fields model for Chinese Named Entity recognition
Author_Institution :
Dept. of Electron. & Commun. Eng., North China Electr. Power Univ., Baoding
Abstract :
This paper presents a new approach of Chinese named entity recognition based on cascaded conditional random fields. In the proposed approach, The model structure has been designed with the cascade way, the result then is passed to the high model and suppose the decision of high model for recognition of the complicated organization names. Person and location were recognized using firstly rule-based and lastly statistical-based, which is different from the previous BIO label recognition approach. But, the organization recognition is recognized using firstly statistical-based and lastly rule-based. Some interesting features have been proposed, the new probabilistic feature is proposed, which are used instead of binary feature functions, however, it is one of the several differences between this model and the most of the previous CRFs-based model. We also explore several new features in our model, which includes confidence functions, position of features etc. We evaluate our approach on large-scale corpus with open test method using Peoplepsilas Daily (January, 1998), The evaluation results show that our approach based on cascaded conditional random fields significantly outperforms previous approaches.
Keywords :
knowledge based systems; natural language processing; probability; random processes; statistical analysis; BIO label recognition; CRFs-based model; Chinese named entity recognition; binary feature functions; cascaded conditional random fields model; large-scale corpus; open test method; organization recognition; probabilistic feature; rule-based recognition; statistical-based recognition; Algorithm design and analysis; Character recognition; Context modeling; Hidden Markov models; Information processing; Large-scale systems; Machine learning; Power engineering and energy; Statistical analysis; Testing;
Conference_Titel :
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2178-7
Electronic_ISBN :
978-1-4244-2179-4
DOI :
10.1109/ICOSP.2008.4697435