DATE 2017

Design Space Exploration of FPGA-Based Accelerators with Multi-Level Parallelism

Guanwen Zhong^1,a, Alok Prakash², Siqi Wang^1,b, Yun Liang³, Tulika Mitra^1,c and Smail Niar⁴
¹School of Computing, National University of Singapore.
^aguanwen@comp.nus.edu.sg
^bsiqi@comp.nus.edu.sg
^ctulika@comp.nus.edu.sg
²SCSE, Nanyang Technological University.
alok@ntu.edu.sg
³School of EECS, Peking University, China.
ericlyun@pku.edu.cn
⁴LAMIH, University of Valenciennes, France.
smail.niar@univ-valenciennes.fr

ABSTRACT

Applications containing compute-intensive kernels with nested loops can effectively leverage FPGAs to exploit fineand coarse-grained parallelism. HLS tools used to translate these kernels from high-level languages (e.g., C/C++), however, are inefficient in exploiting multiple levels of parallelism automatically, thereby producing sub-optimal accelerators. Moreover, the large design space resulting from the various combinations of fineand coarse-grained parallelism options makes exhaustive design space exploration prohibitively time-consuming with HLS tools. Hence, we propose a rapid estimation framework, MPSeeker, to evaluate performance/area metrics of various accelerator options for an application at an early design phase. Experimental results show that MPSeeker can rapidly (in minutes) explore the complex design space and accurately estimate performance/area of various design points to identify the near-optimal (95.7% performance of the optimal on average) combination of parallelism options.

Full Text (PDF)