Title :
Reverse Backtracking Research of Chinese Segmentation Based on Dictionary of Hash Structure
Author :
Zhen, Liang ; Yu-sheng, Li
Author_Institution :
Comput. & Inf. Eng. Dept., Wuhan Polytech. Univ., Wuhan, China
Abstract :
To improve the first word dictionary with hash structure and the reverse maximum matching segmentation algorithm, the last word dictionary based on Hash structure which records word length is designed. By utilizing the reverse maximum matching methods, it realizes the intended objective of improving the speed and reducing the ambiguities rate of word segmentation. This essay explains the design principle of last word dictionary with hash structure and reverse backtracking algorithm, as well as their test effect.
Keywords :
dictionaries; file organisation; word processing; Chinese segmentation; first word dictionary; hash structure dictionary; last word dictionary; reverse backtracking algorithm; reverse maximum matching methods; word segmentation; Algorithm design and analysis; Computers; Dictionaries; Heuristic algorithms; Indexes; Semantics; Vocabulary; Chinese segmentation; last word dictionary; reverse backtracking method; reverse maximum matching method;
Conference_Titel :
Information Technology and Computer Science (ITCS), 2010 Second International Conference on
Conference_Location :
Kiev
Print_ISBN :
978-1-4244-7293-2
Electronic_ISBN :
978-1-4244-7294-9
DOI :
10.1109/ITCS.2010.71