A Compaction Method for STLs for GPU in-field test

Juan-David Guerrero-Balagueraa, Josie E. Rodriguez Condiab and Matteo Sonza Reordac
Department of Control and Computer Engineering Politecnico di Torino, Torino, Italy
ajuan.guerrero@polito.it
bjosie.rodriguez@polito.it
cmatteo.sonzareorda@polito.it

ABSTRACT


Nowadays, Graphics Processing Units (GPUs) are effective platforms for implementing complex algorithms (e.g., for Artificial Intelligence) in different domains (e.g., automotive and robotics), where massive parallelism and high computational effort are required. In some domains, strict safety-critical requirements exist, mandating the adoption of mechanisms to detect faults during the operational phases of a device. An effective test solution is based on Self-Test Libraries (STLs) aiming at testing devices functionally. This solution is frequently adopted for CPUs, but can also be used with GPUs. Nevertheless, the in-field constraints restrict the size and duration of acceptable STLs. This work proposes a method to automatically compact the test programs of a given STL targeting GPUs. The proposed method combines a multi-level abstraction analysis resorting to logic simulation to extract the microarchitectural operations triggered by the test program and the information about the thread-level activity of each instruction and to fault simulation to know its ability to propagate faults to an observable point. The main advantage of the proposed method is that it requires a single fault simulation to perform the compaction. The effectiveness of the proposed approach was evaluated, resorting to several test programs developed for an open-source GPU model (FlexGripPlus) compatible with NVIDIA GPUs. The results show that the method can compact test programs by up to 98.64% in code size and by up to 98.42% in terms of duration, with minimum effects on the achieved fault coverage.

Keywords: Functional Testing, Graphics Processing Units (GPUs), Self-Test Libraries (STLs), Test Compaction.



Full Text (PDF)