Title :
Image character recognition using deep convolutional neural network learned from different languages
Author :
Jinfeng Bai ; Zhineng Chen ; Bailan Feng ; Bo Xu
Author_Institution :
Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
Abstract :
This paper proposes a shared-hidden-layer deep convolutional neural network (SHL-CNN) for image character recognition. In SHL-CNN, the hidden layers are made common across characters from different languages, performing a universal feature extraction process that aims at learning common character traits existed in different languages such as strokes, while the final softmax layer is made language dependent, trained based on characters from the destination language only. This paper is the first attempt to introduce the SHL-CNN framework to image character recognition. Under the SHL-CNN framework, we discuss several issues including architecture of the network, training of the network, from which a suitable SHL-CNN model for image character recognition is empirically learned. The effectiveness of the learned SHL-CNN is verified on both English and Chinese image character recognition tasks, showing that the SHL-CNN can reduce recognition errors by 16-30% relatively compared with models trained by characters of only one language using conventional CNN, and by 35.7% relatively compared with state-of-the-art methods. In addition, the shared hidden layers learned are also useful for unseen image character recognition tasks.
Keywords :
character recognition; feature extraction; image recognition; learning (artificial intelligence); natural language processing; neural nets; Chinese image character recognition; English image character recognition; SHL-CNN framework; SHL-CNN model; character traits; destination language; hidden layers; shared-hidden-layer deep convolutional neural network; softmax layer; universal feature extraction process; unseen image character recognition; Character recognition; Databases; Feature extraction; Image recognition; Neural networks; Optical character recognition software; Training; deep convolutional neural network; image character recognition; multi-task learning;
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
DOI :
10.1109/ICIP.2014.7025518