Reducing Soft Errors in Register File by using Self-Immunity Technique
Abstract
With the scaling of technology, Transient errors caused
by external particle strikes have become a critical challenge for
microprocessor design. As embedded processors are widely used in
reliability-sensitive environments. It becomes important to develop
the cost-effective techniques to improve the processor reliability
against soft errors. This paper focuses on studying the register file
immunity against soft errors. Register file is one of the essential
architecture element. where soft errors can be very harmful because
errors may spread from there throughout the whole system. This paper
addresses a novel technique called Self-Immunity that improves the
integrity of the register file against soft errors. Certain number of
register bits are not always used to represent a value stored in a
register. This paper deals with the difficulty to exploit obvious
observation to enhance the register file integrity against soft errors.
The Self-Immunity technique can reduce the vulnerability of the
register file in terms of area and power overheads.
Keywords
Full Text:
PDFReferences
Greg Bronevetsky and Bronis R. de Supinski, ”Soft Error Vulnerability of
Iterative Linear Algebra Methods,” in the 22nd annual international
conference on Supercomputing, pp. 155-164, 2008.
J.L. Autran, P. Roche, S. Sauze, G. Gasiot, D. Munteanu, P. Loaiza, M.
Zampaolo and J. Borel, “Real-time neutron and alpha soft-error rate
testing of CMOS 130nm SRAM: Altitude versus underground
measurements,” in ICICDT‘08, pp. 233–236, 2008.
S.S. Mukherjee, C. Weaver, J. Emer, S.K. Reinhardt and T. Austin, “A
Systematic Methodology to Compute the Architectural Vulnerability
Factors for a High-Performance Microprocessor,” in International
Symposium on Microarchitecture (MICRO-36), pp.29- 40, 2003.
Jongeun Lee and Aviral Shrivastava, “A Compiler Optimization to
Reduce Soft Errors in Register Files,” in LCTES 2009.
Jason A. Blome, Shantanu Gupta, Shuguang Feng, and Scott Mahlke,
“Cost-efficient soft error protection for embedded microprocessors,” in
CASES ’06, pp. 421–431, 2006.
I. Koren and C. M. Krishna, Fault-Tolerant Systems. San Mateo, CA:
Morgan Kaufmann, 2007.
MiBench (http://www.eecs.umich.edu/mibench/).
M. Fazeli, A. Namazi, and S.G. Miremadi “An energy efficient circuit
level technique to protect register file from MBUs and SETs in embedded
processors,” in Dependable Systems & Networks 2009, pp.195–204,
DNS’09.
K. Walther, C. Galke and H.T. VIERHAUS, “On-Line Techniques for
Error Detection and Correction in Processor Registers with Cross-Parity
Check,” in Journal of Electronic Testing: Theory and Applications 19,
pp.501-510, 2003.
M. Kandala, W. Zhang, and L. Yang, “An area-efficient approach to
improving register file reliability against transient errors,” in Advanced
Information Networking and Applications Workshops, AINAW '07, pp.
–803, 2007.
http://archc.sourceforge.net/.
Jun Yan and Wei Zhang, “Compiler-guided register reliability
improvement against soft errors,” in EMSOFT ’05, pp. 203–209, 2005.
Jongeun Lee and A. Shrivastava, “Compiler-managed register file
protection for energy-efficient soft error reduction,” in ASP-DAC, pp.
–623, 2009.
T.J. Dell, “A whitepaper on the benefits of Chippkill-Correct ECC for PC
server main memory,” in IBM Microelectonics division Nov 1997.
S. Kim and A.K. Somani, “An adaptive write error detection technique in
on-chip caches of multi-level cache systems,” in Journal of
microprocessors and microsystems, pp. 561-570, March 1999.
Riaz Naseer, Rashed Zafar Bhatti, and Jeff Draper, “Analysis of Soft
Error Mitigation Techniques for Register Files in IBM Cu-08 90nm
Technology,” in MWSCAS’06, pp. 515-519, 2006.
M. Rebaudengo, M. S. Reorda, and M.Violante, “An Accurate Analysis
of the Effects of Soft Errors in the Instruction and Data Caches of a
Pipelined Microprocessor,” in DATE’03, pp. 602-607, 2003.
Jongeun Lee and A. Shrivastava, “A Compiler -Microarchitecture Hybrid
Approach to Soft Error Reduction for Register Files,” in Computer-Aided
Design of Integrated Circuits and Systems, pp. 1018-1027, 2010.
M. Spica and T.M. Mak, “Do we need anything more than single bit error
correction (ECC)?,“ in Memory Technology, Design and Testing,
Records of the International Workshop on 9-10, pp. 111–116, 2004.
D.C.Bossen, “CMOS Soft Errors and Server Design,” IEEE 2002
Reliability Physics Tutorial Notes, Reliability Fundamentals, pp.
_07.1 – 121_07.6, April 7, 2002.
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.