• DocumentCode
    2359707
  • Title

    A method of correcting provisional boundaries of “bunsetsu”

  • Author

    Arski, T. ; IKEHARA, Satoru ; Tuchinashi, J.

  • Author_Institution
    Fac. of Eng., Fukui Univ., Japan
  • fYear
    1994
  • fDate
    18-20 Jul 1994
  • Firstpage
    289
  • Lastpage
    293
  • Abstract
    In order to solve the problem that the amount of computer memories required for translating the non-segmented “kana” sentences into the “kanji-kana” sentences grows rapidly in proportion to the increasing of the length of the sentence, a new method of finding provisional boundaries of “bunsetsu” using 2nd-order Markov chain probability has been developed. This paper proposes a method to correct the provisional boundaries of “bunsetsu” for non-segmented “kana” sentences, by looking up all the word candidates in the dictionary. The improvements of “relevance factor” P and “recall factor” R for provisional boundaries of “bunsetsu” determined and corrected by these methods were evaluated by experiment using the statistical data for 70 issues of a daily Japanese newspaper
  • Keywords
    Markov processes; language translation; natural languages; statistical analysis; Japanese; Markov chain probability; bunsetsu; kanji-kana sentences; nonsegmented kana sentences; provisional boundary correction; recall factor; relevance factor; translation; Communication networks; Computer networks; Dictionaries; Humans; Information analysis; Information systems; Intelligent networks; Intelligent robots; Intelligent structures; Laboratories;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robot and Human Communication, 1994. RO-MAN '94 Nagoya, Proceedings., 3rd IEEE International Workshop on
  • Conference_Location
    Nagoya
  • Print_ISBN
    0-7803-2002-6
  • Type

    conf

  • DOI
    10.1109/ROMAN.1994.365915
  • Filename
    365915