DocumentCode
461678
Title
A Practical Approach to Resolving Combination Ambiguity in Chinese Word Segmentation
Author
Qin, Ying ; Zhang, Suxiang ; Wang, Xiaojie
Author_Institution
Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun.
Volume
3
fYear
2006
fDate
16-20 2006
Abstract
In Chinese word segmentation task, combination ambiguity is one of challenges not being well settled. The main obstacle exists in the detection of ambiguous words in given texts and their proper segmentations. This paper puts forward a practical approach to automatically collecting ambiguous words and disambiguating based on maximum entropy principle. The experimental result reveals the approach of automatic collection ambiguous words can detect combination ambiguity effectively avoiding arduous manual work. As to the disambiguation based on maximum entropy, we investigate new features grounded on prior and contextual knowledge and achieve promising result
Keywords
maximum entropy methods; natural language processing; Chinese word segmentation; combination ambiguity; contextual knowledge; maximum entropy principle; Dictionaries; Entropy; Humans; Power engineering and energy; Power engineering computing; Statistics; Telecommunication computing; Testing; Text processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, 2006 8th International Conference on
Conference_Location
Beijing
Print_ISBN
0-7803-9736-3
Electronic_ISBN
0-7803-9736-3
Type
conf
DOI
10.1109/ICOSP.2006.345823
Filename
4129212
Link To Document