• DocumentCode
    2466731
  • Title

    Text localization, enhancement and binarization in multimedia documents

  • Author

    Wolf, Christian ; Jolion, Jean-Michel ; Chassaing, Françoise

  • Author_Institution
    Lab. Reconnaissance de Formes et Vision, Inst. Nat. des Sci. Appliquees de Lyon, Villeurbanne, France
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    1037
  • Abstract
    The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. Anew technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.
  • Keywords
    content-based retrieval; image retrieval; image sequences; interpolation; multimedia databases; optical character recognition; binarization; content based image and video retrieval; images sequences; indexing process; morphological post processing; multimedia documents; optical character recognition; robust multiple frame integration; semantic knowledge; text enhancement; text localization; video sequences; Content based retrieval; Data mining; Feature extraction; Humans; Image processing; Image retrieval; Indexing; Information retrieval; Optical character recognition software; Video sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2002. Proceedings. 16th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-1695-X
  • Type

    conf

  • DOI
    10.1109/ICPR.2002.1048482
  • Filename
    1048482