DocumentCode :
2359707
Title :
A method of correcting provisional boundaries of “bunsetsu”
Author :
Arski, T. ; IKEHARA, Satoru ; Tuchinashi, J.
Author_Institution :
Fac. of Eng., Fukui Univ., Japan
fYear :
1994
fDate :
18-20 Jul 1994
Firstpage :
289
Lastpage :
293
Abstract :
In order to solve the problem that the amount of computer memories required for translating the non-segmented “kana” sentences into the “kanji-kana” sentences grows rapidly in proportion to the increasing of the length of the sentence, a new method of finding provisional boundaries of “bunsetsu” using 2nd-order Markov chain probability has been developed. This paper proposes a method to correct the provisional boundaries of “bunsetsu” for non-segmented “kana” sentences, by looking up all the word candidates in the dictionary. The improvements of “relevance factor” P and “recall factor” R for provisional boundaries of “bunsetsu” determined and corrected by these methods were evaluated by experiment using the statistical data for 70 issues of a daily Japanese newspaper
Keywords :
Markov processes; language translation; natural languages; statistical analysis; Japanese; Markov chain probability; bunsetsu; kanji-kana sentences; nonsegmented kana sentences; provisional boundary correction; recall factor; relevance factor; translation; Communication networks; Computer networks; Dictionaries; Humans; Information analysis; Information systems; Intelligent networks; Intelligent robots; Intelligent structures; Laboratories;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robot and Human Communication, 1994. RO-MAN '94 Nagoya, Proceedings., 3rd IEEE International Workshop on
Conference_Location :
Nagoya
Print_ISBN :
0-7803-2002-6
Type :
conf
DOI :
10.1109/ROMAN.1994.365915
Filename :
365915
Link To Document :
بازگشت