DocumentCode :
2224669
Title :
Person name identification in Chinese documents using finite state automata
Author :
Shen, Bing ; Zhongfei ; Yuan, Chunfa
Author_Institution :
Comput. Sci. Dept., Binghamton Univ., NY, USA
fYear :
2003
fDate :
13-16 Oct. 2003
Firstpage :
478
Lastpage :
481
Abstract :
This research is about automatic identification and extraction of person names in Chinese text documents. Solutions to this problem have immediate and extensive applications in many areas especially in Web Intelligent Agents related applications such as Web search engines, Web data mining, and automatic Web information analysis. We have noted that while finite state automata (FSA) based techniques have been extensively used in NLP and IE in English, they have not yet been extensively used in processing Chinese text, and in particular, to our knowledge, no work has been reported in using FSA in person name identification and extraction. Motivated by this need, we have proposed a person name identification method based on FSA, called NICF. Evaluations show that NICF works very well in terms of identification recall and accuracy, as well as the processing speed, and thus holds a great promise for future applications.
Keywords :
Web sites; automata theory; data mining; finite state machines; search engines; text analysis; Chinese document; Chinese text document; FSA; IE; NICF; NLP; Web information analysis; Web intelligent agents; Web search engine; automatic extraction; automatic idenfication; finite state automata; person name identification; Application software; Automata; Computer science; Data mining; Information analysis; Intelligent agent; Internet; Robustness; Search engines; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1931-8
Type :
conf
DOI :
10.1109/IAT.2003.1241125
Filename :
1241125
Link To Document :
بازگشت