Title :
An efficient dictionary access method for morphological analysis
Author :
Ando, Kazuaki ; Tsuji, Takako ; Fuketa, Masao ; Aoe, Jun-Ichi
Author_Institution :
Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
Abstract :
Proposes an efficient dictionary access method for morphological analysis of oriental languages by extending Aho and Corasick´s (1990) pattern matching machine. The proposed method is a simple and efficient algorithm to find all possible substrings in an input sentence and during a single pass. It stores the relations of grammatical connectivity of adjacent words into the output functions. Therefore, the costs of checking connections between the adjacent words can be reduced by using the connectivity relations. Furthermore, the method of constructing the grammatical connectivity relations is described. Finally, the proposed method is verified by theoretical analysis and an experimental estimation is supported by a computer simulation with a 100,000-word dictionary. From the simulation results, it turns out that the proposed method is 49.9% faster (in CPU time) than the traditional trie approach. In addition, the number of candidates for checking connections was 25.5% less than that of the original morphological analysis
Keywords :
dictionaries; digital simulation; linguistics; natural languages; string matching; CPU time; adjacent words; computer simulation; connection checking costs; dictionary access method; grammatical connectivity; input sentence substrings; morphological analysis; oriental languages; output functions; pattern matching machine; tries; AC machines; Computer simulation; Costs; Dictionaries; Information analysis; Information science; Intelligent systems; Machine intelligence; Natural language processing; Natural languages;
Conference_Titel :
Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
0-7803-4778-1
DOI :
10.1109/ICSMC.1998.725099