DocumentCode
3325266
Title
Optimal techniques in OCR error correction for Japanese texts
Author
Hisamitsu, Toru ; Marukawa, Katsumi ; Shima, Yoshihiro ; Fujisawa, Hiromichi ; Nitta, Yoshihiko
Author_Institution
Adv. Res. Lab., Hitachi Ltd., Saitama, Japan
Volume
2
fYear
1995
fDate
14-16 Aug 1995
Firstpage
1014
Abstract
This paper investigates three fundamental techniques in OCR error correction for Japanese texts using morphological analysis: (1) an optimal method for candidate word extraction from a candidate character lattice, (2) optimal word entries for Japanese verb inflection analysis, and (3) a new method of word matching cost calculation which is more suitable to be used with linguistic criteria. Comparative evaluation shows that the combination of these techniques requires 84% less computation, captures 2.6% more candidate words, reduces the chart parsing computation by 20%, and attains 25% higher error correction rate than a commonly used method
Keywords
error correction; grammars; optical character recognition; Japanese texts; Japanese verb inflection analysis; OCR error correction; candidate character lattice; candidate word extraction; chart parsing computation; linguistic criteria; morphological analysis; optimal method; optimal techniques; optimal word entries; word matching cost calculation; Character generation; Cost function; Dictionaries; Distributed decision making; Error analysis; Error correction; Laboratories; Lattices; Optical character recognition software; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location
Montreal, Que.
Print_ISBN
0-8186-7128-9
Type
conf
DOI
10.1109/ICDAR.1995.602074
Filename
602074
Link To Document