DocumentCode
2359707
Title
A method of correcting provisional boundaries of “bunsetsu”
Author
Arski, T. ; IKEHARA, Satoru ; Tuchinashi, J.
Author_Institution
Fac. of Eng., Fukui Univ., Japan
fYear
1994
fDate
18-20 Jul 1994
Firstpage
289
Lastpage
293
Abstract
In order to solve the problem that the amount of computer memories required for translating the non-segmented “kana” sentences into the “kanji-kana” sentences grows rapidly in proportion to the increasing of the length of the sentence, a new method of finding provisional boundaries of “bunsetsu” using 2nd-order Markov chain probability has been developed. This paper proposes a method to correct the provisional boundaries of “bunsetsu” for non-segmented “kana” sentences, by looking up all the word candidates in the dictionary. The improvements of “relevance factor” P and “recall factor” R for provisional boundaries of “bunsetsu” determined and corrected by these methods were evaluated by experiment using the statistical data for 70 issues of a daily Japanese newspaper
Keywords
Markov processes; language translation; natural languages; statistical analysis; Japanese; Markov chain probability; bunsetsu; kanji-kana sentences; nonsegmented kana sentences; provisional boundary correction; recall factor; relevance factor; translation; Communication networks; Computer networks; Dictionaries; Humans; Information analysis; Information systems; Intelligent networks; Intelligent robots; Intelligent structures; Laboratories;
fLanguage
English
Publisher
ieee
Conference_Titel
Robot and Human Communication, 1994. RO-MAN '94 Nagoya, Proceedings., 3rd IEEE International Workshop on
Conference_Location
Nagoya
Print_ISBN
0-7803-2002-6
Type
conf
DOI
10.1109/ROMAN.1994.365915
Filename
365915
Link To Document