مرکز منطقه ای اطلاع رساني علوم و فناوري - Semantic keyword extraction via adaptive text binarization of unstructured unsourced video

DocumentCode :

3634912

Title :

Semantic keyword extraction via adaptive text binarization of unstructured unsourced video

Author :

Michele Merler;John R. Kender

Author_Institution :

Department of Computer Science, Columbia University, USA

fYear :

2009

Firstpage :

261

Lastpage :

264

Abstract :

We propose a fully automatic method for summarizing and indexing unstructured presentation videos based on text extracted from the projected slides. We use changes of text in the slides as a means to segment the video into semantic shots. Unlike precedent approaches, our method does not depend on availability of the electronic source of the slides, but rather extracts and recognizes the text directly from the video. Once text regions are detected within keyframes, a novel binarization algorithm, Local Adaptive Otsu (LOA), is employed to deal with the low quality of video scene text, before feeding the regions to the open source Tesseract1 OCR engine for recognition. We tested our system on a corpus of 8 presentation videos for a total of 1 hour and 45 minutes, achieving 0.5343 Precision and 0.7446 Recall Character recognition rates, and 0.4947 Precision and 0.6651 Recall Word recognition rates. Besides being used for multimedia documents, topic indexing, and cross referencing, our system can be integrated into summarization and presentation tools such as the VAST MultiMedia Browser [1].

Keywords :

"Text recognition","Layout","Optical character recognition software","Engines","Data mining","Audio recording","Video recording","Indexing","Character recognition","Multimedia systems"

Publisher :

ieee

Conference_Titel :

Image Processing (ICIP), 2009 16th IEEE International Conference on

ISSN :

1522-4880

Print_ISBN :

978-1-4244-5653-6

Electronic_ISBN :

2381-8549

Type :

conf

DOI :

10.1109/ICIP.2009.5413432

Filename :

5413432

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3634912