Abstract :
For the gray-level image of government resource document, a linear transform was employed to enhance the image contrast. A spatial filter was applied to eliminate image noise. After this preprocessing, the threshold surface T1 was computed by Bernsen algorithm, and the global threshold T2 was calculated by modified Otsu method. On the basis of T1 and T2, other three thresholds were defined, which include the broken stroke value T3, the average value T4 of neighborhood and the union value T5 between global and local. Then the gray-level image was binarized through the combination of these five values. Our experiments showed that the proposed method using these five thresholds was adaptive to various government documents. By ghost artifacts eliminating and the broken strokes mending, it´s benefit to OCR.