Environmental, Initialization, Message passing – Compaq RAID ARRAY 3000 EK-SMCPO-UG. C01 User Manual

Page 55: Failover

Advertising
background image

RAID Array Controller 2-21

Compaq Confidential – Need to Know Required

Writer: Bob Young Project: RAID Array 3000 Pedestal Storage Subsystem Hardware User’s Guide Comments:

Part Number: EK-SMCPO-UG. C01 File Name: c-ch2 RAID Array Controller.doc Last Saved On: 12/4/00 1:51 PM

Initialization

During initialization, the firmware in the RAID 3000 verifies that both
controllers have consistent configurations including identical memory cache
and system parameters. If the controller setups are incompatible, the set is not
bound and each controller operates in stand-alone mode.

Message Passing

Information is shared between the two controllers by a collection of messages
passed through the backplane connectors. The messages provide configuration
data as well as a heartbeat which is transmitted by each controller every
500 ms. If a controller does not receive a heartbeat within one second, it
assumes the peer controller has become inoperable and begins failing over.

If the controllers cannot exchange messages due to communication problems
over the backplane, they will break the connection and each controller will
switch to a stand-alone mode.

Failover

Failover describes the process of transferring data from a failed controller to a
survivor and completing any active tasks. When one controller begins the fail-
over process, it sends a reset to the other controller, which prevents the failing
unit from processing any more information and enables any host ports that are
passive. It then downloads the failed controller’s cache to its unused portion of
cache and begins acting upon that data.

While downloading the data, the controller responds to I/O by disconnecting
(if allowed) and waiting approximately three seconds before reconnecting and
presenting a Busy status. The delay is to prevent host operating systems from
seeing too many errors and fencing off the controller.

Environmental

The controller incorporates a set of on board sensors to detect abnormal
operating conditions that may affect data safety.

Advertising