DocumentCode
2154792
Title
Compression using self-similarity-based temporal super-resolution for full-exposure-time video
Author
Shimano, Mihoko ; Cheung, Gene ; Sato, Imari
Author_Institution
PRESTO, Univ. of Tokyo, Tokyo, Japan
fYear
2011
fDate
22-27 May 2011
Firstpage
1053
Lastpage
1056
Abstract
In order to allow sufficient amount of light into the image sen sor, videos captured in poor lighting conditions typically have low frame rate and frame exposure time equals to inter-frame period-commonly called full exposure time (FET). FET low-frame-rate videos are common in situations where lighting cannot be improved a priori due to practical (e.g., large physical distance between camera and captured objects) or economical (e.g., long duration of night-time surveillance) reasons. Previous work in computer vision has shown that content at a desired higher frame rate can be recovered (to some extent) from the captured FET video using self-similarity-based temporal super-resolution. From an end-to-end communication standpoint, however, the following practical question remains: what is the most compact representation of the captured FET video at encoder, given that a higher frame rate reconstruction is desired at the decoder? In this paper, we present a compression strategy, where, for a given targeted rate-distortion (RD) tradeoff, FET video frames at appropriate temporal resolutions are selected for encoding using standard H.264 tools at encoder. At the decoder, temporal super-resolution is performed on the decoded frames to synthesize the desired high frame rate video. We formulate the selection of individual FET frames at different temporal resolutions as a shortest path problem to minimize Lagrangian cost of the encoded sequence. Then, we propose a computation-efficient algorithm based on monotonicity in predictor´s temporal resolution to find the shortest path. Experiments show that our strategy outperforms an alternative naive approach of encoding all FET frames as is and performing temporal super-resolution at decoder by up to 1.1dB at the same bitrate.
Keywords
computer vision; data compression; image reconstruction; image representation; image resolution; image sensors; video coding; FET low frame rate video; H.264 tools; Lagrangian cost; bit rate; computation efficient algorithm; computer vision; end-to-end communication standpoint; frame exposure time; full exposure time video; high frame rate reconstruction; image sensor; lighting condition; rate distortion tradeoff; self similarity-based temporal super resolution; video capture; Bit rate; Decoding; Encoding; FETs; Spatial resolution; Strontium; Video compression; self similarity; super-resolution;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946588
Filename
5946588
Link To Document