• DocumentCode
    1638017
  • Title

    A Robust Wavelet Transform Based Technique for Video Text Detection

  • Author

    Shivakumara, Palaiahnakote ; Phan, Trung Quy ; Tan, Chew Lim

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2009
  • Firstpage
    1285
  • Lastpage
    1289
  • Abstract
    In this paper, we propose a new method based on wavelet transform, statistical features and central moments for both graphics and scene text detection in video images. The method uses wavelet single level decomposition LH, HL and HH subbands for computing features and the computed features are fed to k means clustering to classify the text pixel from the background of the image. The average of wavelet subbands and the output of k means clustering helps in classifying true text pixel in the image. The text blocks are detected based on analysis of projection profiles. Finally, we introduce a few heuristics to eliminate false positives from the image. The robustness of the proposed method is tested by conducting experiments on a variety of images of low contrast, complex background, different fonts, and size of text in the image. The experimental results show that the proposed method outperforms the existing methods in terms of detection rate, false positive rate and misdetection rate.
  • Keywords
    computer graphics; image classification; object detection; pattern clustering; statistical analysis; text analysis; video signal processing; wavelet transforms; computer graphics; image HH text detection; image HL text detection; image LH text detection; image classification; k means-clustering; projection profile analysis; robust wavelet transform; statistical feature; text pixel; video image text detection technique; wavelet single level decomposition; Graphics; Image analysis; Image edge detection; Image recognition; Layout; Pixel; Robustness; Text analysis; Wavelet analysis; Wavelet transforms; k-means clsutering; statistical features; text detection; wavelet subbands;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.83
  • Filename
    5277693