Title :
Gaussian Mixture Modeling of Neighbor Characters for Multilingual Text Extraction in Images
Author :
Hui Fu ; Xiabi Liu ; Yunde Jia ; Hongbin Deng
Author_Institution :
Dept. of Comput. Sci. & Eng., Beijing Inst. of Technol., China
Abstract :
This paper proposes a new method to extract multilingual text in images through discriminating characters from non-characters based on the Gaussian mixture modeling of neighbor characters. The image is binarized and the morphological closing operation is performed on the binary image, in order that each character in it can be treated as a connected component; the neighborhood of connected components are computed based on the Voronoi partition of the image, and each connected component is labeled as character or non-character according to its neighbors. We applied the proposed text extraction method to Chinese and English text extraction, the effectiveness of which is confirmed by the experimental results.
Keywords :
Gaussian processes; character recognition; computational geometry; document image processing; feature extraction; image morphing; linguistics; text analysis; Chinese text extraction; English text extraction; Gaussian mixture modeling; Voronoi partition; character discrimination; morphological closing operation; multilingual text extraction; Computer science; Document image processing; Gaussian distribution; Image color analysis; Image recognition; Image texture analysis; Indexing; Optical character recognition software; Testing; Text recognition; Document image processing; Gaussian distributions; Text recognition;
Conference_Titel :
Image Processing, 2006 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
1-4244-0480-0
DOI :
10.1109/ICIP.2006.312883