• DocumentCode
    3296946
  • Title

    From Text Detection in Videos to Person Identification

  • Author

    Poignant, Johann ; Besacier, Laurent ; Quénot, Georges ; Thollard, Franck

  • Author_Institution
    Grenoble INP, LIG, UJF-Grenoble 1/UPMF-Grenoble 2, Grenoble, France
  • fYear
    2012
  • fDate
    9-13 July 2012
  • Firstpage
    854
  • Lastpage
    859
  • Abstract
    We present in this article a video OCR system that detects and recognizes overlaid texts in video as well as its application to person identification in video documents. We proceed in several steps. First, text detection and temporal tracking are performed. After adaptation of images to a standard OCR system, a final post-processing combines multiple transcriptions of the same text box. The semi-supervised adaptation of this system to a particular video type (video broadcast from a French TV) is proposed and evaluated. The system is efficient as it runs 3 times faster than real time (including the OCR step) on a desktop Linux box. Both text detection and recognition are evaluated individually and through a person recognition task where it is shown that the combination of OCR and audio (speaker) information can greatly improve the performances of a state of the art audio based person identification system.
  • Keywords
    text detection; video signal processing; French TV; art audio based person identification system; audio information; desktop Linux box; images adaptation; overlaid texts detection; overlaid texts recognition; semi-supervised adaptation; speaker information; standard OCR system; temporal tracking; video OCR system; video broadcast; video documents; Error analysis; Humans; Optical character recognition software; Speech recognition; TV; Text recognition; Videos; Video OCR; person identification; semi-supervised parametrization; text detection; text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2012 IEEE International Conference on
  • Conference_Location
    Melbourne, VIC
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4673-1659-0
  • Type

    conf

  • DOI
    10.1109/ICME.2012.119
  • Filename
    6298510