DocumentCode
381373
Title
Multi-layered videotext extraction method
Author
Agnihotri, Lalitha ; Dimitrova, Nevenka ; Soletic, Mario
Author_Institution
Philips Res. USA, Briarcliff Manor, NY, USA
Volume
2
fYear
2002
fDate
2002
Firstpage
213
Abstract
The importance of video content analysis and retrieval increases as storage capacity grows in consumer devices. Videotext detection is very important for video indexing and retrieval enabling features such as commercial detection, intelligent keyframe extraction, program boundary detection and others. In this paper we propose multi-layered videotext detection tailored to an MP EG encoding scheme. We approached the videotext detection from a perspective of scalability and flexibility for different platforms with varying resources. We propose a three-layered algorithm. The first layer works in compressed domain features such as macroblock type. The second layer works in semicompressed domain such as DCT coefficients. The third layer works in uncompressed domain, i.e. spatial domain. As the next layer gets implemented the complexity increases but so does the precision of the algorithm. On constrained platforms, only the first layer would be implemented. On high-end platforms all three layers could be implemented to enable a full suite of indexing and retrieval applications.
Keywords
content-based retrieval; discrete cosine transforms; feature extraction; image retrieval; indexing; information retrieval; transform coding; video coding; viewdata; DCT coefficients; MPEG encoding; commercial detection; complexity; compressed domain features; constrained platforms; flexibility; intelligent keyframe extraction; macroblock type; multi-layered videotext extraction method; program boundary detection; scalability; semicompressed domain; spatial domain; three-layered algorithm; uncompressed domain; video content analysis; video indexing; video retrieval; videotext detection; Algorithm design and analysis; Change detection algorithms; Content based retrieval; Encoding; Face detection; Image color analysis; Indexing; Layout; Performance analysis; Transform coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7803-7304-9
Type
conf
DOI
10.1109/ICME.2002.1035552
Filename
1035552
Link To Document