Title :
A cache-aware motion estimation organization for a hardware-based H.264 encoder
Author :
Deumji Woo ; Chae Rhee ; Hyuk-Jae Lee
Author_Institution :
Next Generation AP team, Electron., Yongin, South Korea
Abstract :
The video resolution required for many types of video content has increased as technology has advanced. For the real-time encoding of the high resolutions such as full high definition (FHD), quad-FHD (QFHD) and beyond, various fast motion estimation (ME) algorithms have been researched. Caches are used for many fast MEs in a hardware-based encoder, in order to increase local memory utilization and thereby reduce external memory access. However, most previous works do not pay attention to the amount of cache access from multiple MEs. In a multi-core environment for high resolution videos, access conflicts directly affect the computation time. In this paper, various types of caches are compared in terms of the size, hit ratio, cache port conflicts and hardware overhead. To reduce the amount of cache access associated with the basic shared cache, zigzag snake scan and selective data-storage schemes are proposed for integer and fractional MEs, respectively. Additionally, the cache access arbitration hides the computation delay which arises due to a cache port conflict in a pipeline system. The proposed schemes are applicable for the existing cache design achieving a good scalability in a multi-core environment. Simulation results show that the ME computation time reduced by the proposed schemes is comparable to that of the dual-port shared cache which shows the least amount of port conflicts.
Keywords :
cache storage; code standards; motion estimation; pipeline processing; shared memory systems; video coding; ME computation time reducion; access conflict; cache access arbitration; cache aware motion estimation organization; cache design; cache port conflict; computation delay hiding; data storage scheme; dual port shared cache; external memory access reduction; fractional ME; hardware-based H.264 encoder; integer ME; local memory utilization; multicore environment; pipeline system; video content; video resolution; zigzag snake scan; Engines; Hardware; Interpolation; Loading; Organizations; Ports (Computers); Streaming media; Cache; External memory access; H.264; Internal memory access; Motion Estimation;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2014.6780929