Or section 9.35 – HP StorageWorks Scalable File Share User Manual

Page 285

Advertising
background image

Recovering degraded MDS services on systems using EVA4000 storage

9–61

For more information on reviewing SFS20 array information, see Section 6.1.4.2.

If, after further investigation, you decide to replace a disk that is logging URE errors, see Section 8.1.10 for

more information.

TIP:

You can use the

diskinfo

wrapper script for the

hpls_cciss_info

command to report the drive

status on all SFS20 arrays attached to a server. To run the

diskinfo

script, enter the following command

on the server:

# /usr/opt/hpls/diags/bin/diskinfo -v

The command reports disk and controller information for all arrays attached to the server. The

-v

argument

is optional; if you do not use it, the command provides the report in a shorter summary format.

9.35 Recovering degraded MDS services on systems using EVA4000

storage

If an MDS service using an EVA4000 storage array that temporarily failed was mirrored, the RAID status of

the service may be running in

degraded

mode. The

show filesystem

command displays the state of

the service, as shown in the following example:

sfs> show filesystem

Name State Services
------ -------------- ----------------------------------
data started mds8: running(raid: degraded), ost[29-30]: running

In this example, the

mds8

service is shown to be running in

degraded

mode. This means that one mirror

device is operating normally and the other has failed.

Note the following points:

If the disk group itself has failed, rebuild the disk group, create a new virtual disk and present it to the

servers before starting the recovery process. Use the

set lun

command to set the

role

and

used

by

values appropriately.

In most cases where services are degraded, the disk group has not failed. Instead, either a controller

has failed or the Fibre Channel fabric between the controller and the server has failed. You can start

the recovery process described below as soon as both component LUNs are visible (as shown by the

show lun

command), even if the failed controller module has not yet been repaired (the LUN should

be visible through the other controller).

You can recover the service and restore it to correct operation as follows:

1.

If the file system is stopped, start the file system.

2.

Identify the LUNs that the MDS or OST service is based on, by using the

show filesystem

filesystem_name

command as follows. In this example, the

mds8

service is using the mirrored

LUN number 41:

sfs> show filesystem data
.

.

.

MDS Information:


Name LUN Array Controller Files Used Service State Running on
----- --- ----- ---------- -------- ---- ------------- ----------
mds8 41 1 ab 2.3M 20% running south2

OST Information:

Name LUN Array Controller Size(GB) Used Service State Running on

----- --- ----- ---------- -------- ---- ------------- ----------
.

.

.

ost29 34 3 a 290 30% running south4
.

.

.

Advertising