Troubleshooting raid controller modules, Conditions, Invalid storage array – Dell POWERVAULT MD3620I User Manual

Page 283: Ecc errors

Advertising
background image

Troubleshooting: Your Array

281

Troubleshooting RAID Controller Modules

Conditions

Certain events can cause a RAID controller module to fail and/or shut down.

Unrecoverable ECC memory or PCI errors, or critical physical conditions can

cause lockdown. If your RAID storage array is configured for redundant access

and cache mirroring, the surviving controller can normally recover without

data loss or shutdown.
Typical hard controller failures are detailed in the following sections.

Invalid Storage Array

The RAID controller module is supported only in a Dell-supported storage

array. Upon installation in the storage array, the controller performs a set of

validation checks. The array status LED is lit with a steady amber color while

the RAID controller module completes these initial tests and the controllers

are booted successfully. If the RAID controller module detects a non-Dell

supported storage array, the controller aborts startup. The RAID controller

module does not generate any events to alert you in the event of an invalid

array, but the array status LED is lit with a flashing amber color to indicate a

fault state.
For full details on the LEDs and their interpretation, see "Back-Panel Features

and Indicators" on page 27.

ECC Errors

RAID controller firmware can detect ECC errors and can recover from a

single-bit ECC error irrespective of the RAID controller module

configuration. A storage array with redundant controllers can recover from

multi-bit ECC errors as well because the peer RAID controller module can

take over, if necessary.
The RAID controller module failsover if it experiences up to 10 single-bit

errors, or up to 3 multi-bit errors.

book.book Page 281 Tuesday, June 18, 2013 3:09 PM

Advertising
This manual is related to the following products: