Title :
Compression using self-similarity-based temporal super-resolution for full-exposure-time video
Author :
Shimano, Mihoko ; Cheung, Gene ; Sato, Imari
Author_Institution :
PRESTO, Univ. of Tokyo, Tokyo, Japan
Abstract :
In order to allow sufficient amount of light into the image sen sor, videos captured in poor lighting conditions typically have low frame rate and frame exposure time equals to inter-frame period-commonly called full exposure time (FET). FET low-frame-rate videos are common in situations where lighting cannot be improved a priori due to practical (e.g., large physical distance between camera and captured objects) or economical (e.g., long duration of night-time surveillance) reasons. Previous work in computer vision has shown that content at a desired higher frame rate can be recovered (to some extent) from the captured FET video using self-similarity-based temporal super-resolution. From an end-to-end communication standpoint, however, the following practical question remains: what is the most compact representation of the captured FET video at encoder, given that a higher frame rate reconstruction is desired at the decoder? In this paper, we present a compression strategy, where, for a given targeted rate-distortion (RD) tradeoff, FET video frames at appropriate temporal resolutions are selected for encoding using standard H.264 tools at encoder. At the decoder, temporal super-resolution is performed on the decoded frames to synthesize the desired high frame rate video. We formulate the selection of individual FET frames at different temporal resolutions as a shortest path problem to minimize Lagrangian cost of the encoded sequence. Then, we propose a computation-efficient algorithm based on monotonicity in predictor´s temporal resolution to find the shortest path. Experiments show that our strategy outperforms an alternative naive approach of encoding all FET frames as is and performing temporal super-resolution at decoder by up to 1.1dB at the same bitrate.
Keywords :
computer vision; data compression; image reconstruction; image representation; image resolution; image sensors; video coding; FET low frame rate video; H.264 tools; Lagrangian cost; bit rate; computation efficient algorithm; computer vision; end-to-end communication standpoint; frame exposure time; full exposure time video; high frame rate reconstruction; image sensor; lighting condition; rate distortion tradeoff; self similarity-based temporal super resolution; video capture; Bit rate; Decoding; Encoding; FETs; Spatial resolution; Strontium; Video compression; self similarity; super-resolution;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946588