Title :
Bigram Chinese Word Segmentation by Viterbi Algorithm
Author :
Liu, Dan ; Fang, Weiguo ; Zhou, Hong ; Li, Yan
Author_Institution :
Sch. of Econ. & Manage., Beihang Univ., Beijing, China
Abstract :
Chinese word segmentation is an important foundation for Chinese information processing. This paper proposes a new Chinese word segmentation model based on Bayesian network. In this model, Character alignment Viterbi algorithm, which treats the preceding word of each Chinese character as its state, and the N-gram probability as its state transition probability, is suggested to be combined with Viterbi algorithm to achieve better performance. The model we proposed also achieves word sense disambiguation and auto recognition of foreign and domestic person names together. It is demonstrated to be more efficient in word segmentation under better precision and recall.
Keywords :
Viterbi detection; belief networks; character recognition; natural language processing; probability; word processing; Bayesian network; Chinese character; Chinese information processing; Viterbi algorithm; bigram Chinese word segmentation; character alignment Viterbi algorithm; domestic person name; foreign person name; n-gram probability; state transition probability; word recognition; Bayesian methods; Conference management; Fuzzy systems; Hidden Markov models; Information processing; Knowledge management; Predictive models; Probability; Testing; Viterbi algorithm; Bayesian Network; N-gram; Viterbi Algorithm;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.590