Title :
Patra: Parallel tree-reweighted message passing architecture
Author :
Wenlai Zhao ; Haohuan Fu ; Guangwen Yang ; Luk, Wayne
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
Maximum a posteriori probability inference algorithms for Markov Random Field are widely used in many applications, such as computer vision and machine learning. Sequential tree-reweighted message passing (TRW-S) is an inference algorithm which shows good quality in finding optimal solutions. However, the performance of TRW-S in software cannot meet the requirements of many real-time applications, due to the sequential scheme and the high memory, bandwidth and computational costs. This paper proposes Patra, a novel parallel tree-reweighted message passing architecture, which involves a fully pipelined design targeting FPGA technology. We build a hybrid CPU/FPGA system to test the performance of Patra for stereo matching. Experimental results show that Patra provides about 100 times faster than a software implementation of TRW-S, and 12 times faster than a GPU-based message passing algorithm. Compared with an existing design in four FPGAs, we can achieve 2 times speedup in a single FPGA. Moreover, Patra can work at video rate in many cases, such as a rate of 167 frame/sec for a standard stereo matching test case, which makes it promising for many real-time applications.
Keywords :
Markov processes; field programmable gate arrays; image matching; logic design; maximum likelihood estimation; message passing; parallel architectures; probability; stereo image processing; trees (mathematics); FPGA technology; GPU-based message passing algorithm; Markov random field; Patra; TRW-S; computational costs; computer vision; fully pipelined design; hybrid CPU-FPGA system; machine learning; maximum a posteriori probability inference algorithms; parallel tree-reweighted message passing architecture; sequential tree-reweighted message passing; stereo matching; video rate; Accuracy; Algorithm design and analysis; Bandwidth; Field programmable gate arrays; Message passing; Random access memory; Vectors; FPGA; Markov Random Field; Maximum a posteriori probability; Parallel Tree-Reweighted Message Passing;
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2014 24th International Conference on
Conference_Location :
Munich
DOI :
10.1109/FPL.2014.6927506