• DocumentCode
    739697
  • Title

    Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images

  • Author

    Guozhu Liang ; Shivakumara, Palaiahnakote ; Tong Lu ; Chew Lim Tan

  • Author_Institution
    Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
  • Volume
    24
  • Issue
    11
  • fYear
    2015
  • Firstpage
    4488
  • Lastpage
    4501
  • Abstract
    Scene text detection from video as well as natural scene images is challenging due to the variations in background, contrast, text type, font type, font size, and so on. Besides, arbitrary orientations of texts with multi-scripts add more complexity to the problem. The proposed approach introduces a new idea of convolving Laplacian with wavelet sub-bands at different levels in the frequency domain for enhancing low resolution text pixels. Then, the results obtained from different sub-bands (spectral) are fused for detecting candidate text pixels. We explore maxima stable extreme regions along with stroke width transform for detecting candidate text regions. Text alignment is done based on the distance between the nearest neighbor clusters of candidate text regions. In addition, the approach presents a new symmetry driven nearest neighbor for restoring full text lines. We conduct experiments on our collected video data as well as several benchmark data sets, such as ICDAR 2011, ICDAR 2013, and MSRA-TD500 to evaluate the proposed method. The proposed approach is compared with the state-of-the-art methods to show its superiority to the existing methods.
  • Keywords
    image fusion; text detection; video signal processing; wavelet transforms; Laplacian; arbitrarily oriented scene text detection; maxima stable extreme regions; multiscripts; multispectral fusion; natural scene images; text pixel resolution; video images; wavelet subbands; Image color analysis; Image edge detection; Image resolution; Laplace equations; Streaming media; Text recognition; Transforms; Laplacian-wavelet; Maxima stable extreme regions; Multi spectral fusion; arbitrarily oriented video text detection; maxima stable extreme regions; multi spectral fusion; stroke width transform;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2465169
  • Filename
    7180356