A Hybrid Approach for Detection and Correction of Transient Faults in SoCs

作者:Bernardi P*; Poehls L M Bolzani; Grosso M; Reorda M Sonza
来源:IEEE Transactions on Dependable and Secure Computing, 2010, 7(4): 439-445.
DOI:10.1109/2010.33

摘要

Critical applications based on Systems-on-Chip (SoCs) require suitable techniques that are able to ensure a sufficient level of reliability. Several techniques have been proposed to improve fault detection and correction capabilities of faults affecting SoCs. This paper proposes a hybrid approach able to detect and correct the effects of transient faults in SoC data memories and caches. The proposed solution combines some software modifications, which are easy to automate, with the introduction of a hardware module, which is independent of the specific application. The method is particularly suitable to fit in a typical SoC design flow and is shown to achieve a better trade-off between the achieved results and the required costs than corresponding purely hardware or software techniques. In fact, the proposed approach offers the same fault-detection and -correction capabilities as a purely software-based approach, while it introduces nearly the same low memory and performance overhead of a purely hardware-based one.

  • 出版日期2010-12