Title :
New methods for detecting characters wrongly deleted and inserted in Japanese strings and their applicability to DNA chains
Author :
Araki, Tetsuo ; IKEHARA, Satoru ; Tsukahara, Nobuyuki
Author_Institution :
Fac. of Eng., Fukui Univ., Japan
Abstract :
This paper proposes methods to detect and to correct the characters wrongly inserted and deleted in natural language. Natural language is physically different from DNA, however it has a lot of common characteristics in point of medium representing information. Accordingly the methods proposed here are expected to be applied to detect errors in DNA chains. In optical character recognition and continuous speech recognition of a natural language, it has been difficult to detect error characters which are wrongly deleted and inserted. In order to detect and correct these errors, this paper proposes new methods using m-th order Markov chain model for Japanese syllables and "kanji-kana" characters, assuming that Markov probability of a correct chain of syllables or "kanji-kana" characters is greater than that of erroneous chains. From the results of the experiments, it is concluded that the method is useful for detecting as well as correcting these errors.<>
Keywords :
DNA; Markov processes; biology computing; cellular biophysics; medical administrative data processing; natural languages; optical character recognition; speech recognition; DNA; DNA chains; Japanese strings; Japanese syllables; Markov chain model; Markov probability; character correction; continuous speech recognition; error characters; error correction; kanji-kana characters; natural language; optical character recognition; syllable chain; wrongly deleted character detection;
Conference_Titel :
System Sciences, 1994. Proceedings of the Twenty-Seventh Hawaii International Conference on
Conference_Location :
Wailea, HI, USA
Print_ISBN :
0-8186-5090-7
DOI :
10.1109/HICSS.1994.323582