Title :
Thai OCR: a neural network application
Author :
Tanprasert, Chularat ; Koanantakool, Thaweesak
Author_Institution :
Nat. Electron. & Comput. Technol. Center, Nat. Sci. & Technol Dev. Agency, Bangkok, Thailand
Abstract :
Thai optical character recognition (Thai OCR) is one of the most desirable computer applications in Thailand at present. Though there are several proposed techniques for solving the problem, none seems to produce a satisfactory practical result. Many limited factors are encountered such as the inconsistency of the scanning process, the incomplete and/or noisy original documents, and the shift and position variance of the recognition technique. We proposed to apply artificial neural networks (ANNs) together with some pre-processing and post-processing techniques to solve the Thai OCR problem. The experimental results confirm that ANNs are a very suitable technique for developing the Thai OCR software. The recognition rate on a real document of training fonts is about 90%-95%. This leads to a possible implementation in a production-quality OCR software that the NECTEC software technology laboratory is working on. The details of all processes are explained in the paper
Keywords :
backpropagation; document image processing; multilayer perceptrons; optical character recognition; NECTEC software technology laboratory; Thai OCR; Thailand; artificial neural networks; backpropagation learning algorithm; incomplete original documents; multilayer perceptron; neural network application; noisy original documents; optical character recognition; position variance; post-processing techniques; pre-processing techniques; production-quality OCR software; recognition rate; scanning process inconsistency; shift variance; training fonts; Character recognition; Cleaning; Engines; Image segmentation; Natural languages; Neural networks; Optical character recognition software; Shape; Tail; Testing;
Conference_Titel :
TENCON '96. Proceedings., 1996 IEEE TENCON. Digital Signal Processing Applications
Conference_Location :
Perth, WA
Print_ISBN :
0-7803-3679-8
DOI :
10.1109/TENCON.1996.608717