Sun Microsystems Sun Fire X4240 User Manual

Page 75

Advertising
background image

Appendix D

Error Handling

65

Single-bit
DRAM ECC
error

With ECC enabled
in the BIOS Setup,
the CPU detects
and corrects a
single-bit error on
the DIMM interface.

The CPU corrects the error in hardware. No
interrupt or machine check is generated by
the hardware. The polling is triggered every
half-second by SMI timer interrupts and is
done by the BIOS SMI handler.
The BIOS SMI handler starts logging each
detected error and stops logging when the
limit for the same error is reached. The BIOS's
polling can be disabled through a software
interface.

SP SEL

Normal
operation

Single four-bit
DRAM error

With CHIP-KILL
enabled in the BIOS
Setup, the CPU
detects and corrects
for the failure of a
four-bit-wide
DRAM on the
DIMM interface.

The CPU corrects the error in hardware. No
interrupt or machine check is generated by
the hardware. The polling is triggered every
half-second by SMI timer interrupts and is
done by the BIOS SMI handler.
The BIOS SMI handler starts logging each
detected error and stops logging when the
limit for the same error is reached. The BIOS's
polling can be disabled through a software
interface.

SP SEL

Normal
operation

Uncorrectable
DRAM ECC
error

The CPU detects an
uncorrectable
multiple-bit DIMM
error.

The “sync flood” method is used to prevent
the erroneous data from being propagated
across the Hypertransport links. The system
reboots, the BIOS recovers the machine check
register information, maps this information to
the failing DIMM (when CHIPKILL is
disabled) or DIMM pair (when CHIPKILL is
enabled), and logs that information to the SP.
The BIOS will halt the CPU.

SP SEL

Fatal

Unsupported
DIMM
configuration

Unsupported
DIMMs are used, or
supported DIMMs
are loaded
improperly.

The BIOS displays an error message, logs an
error, and halts the system.

DMI Log
SP SEL

Fatal

HyperTranspor
t link failure

CRC or link error
on one of the
Hypertransport
Links.

Sync floods on HyperTransport links, the
machine resets itself, and error information
gets retained through reset.
The BIOS reports, A Hyper Transport
sync flood error occurred on last

boot, press F1 to continue

.

DMI Log
SP SEL

Fatal

TABLE D-1

Hardware Error Handling Summary (Continued)

Error

Description

Handling

Logged (DMI
Log or SP
SEL)

Fatal?

Advertising
This manual is related to the following products: