2 monitoring device critical errors, 6 oosd tests, Commands execution – Artesyn ViewCheck on ATCA-7470/7475 Installation and Use (May 2014) User Manual

Page 86

Advertising
background image

Commands Execution

ViewCheck on ATCA-7470/7475 Installation and Use (6806800S49C)

86

5.5.2

Monitoring Device Critical Errors

Under Linux OS, the Device Drivers LOG abnormal behavior and potential errors occurring in
the Hardware device with KERN_ERR or KERN_CRIT category. These notifications are
considered as potential errors as they could manifest into latent faults in the live system.

As part of monitoring the Device Critical errors, all such Kernel CRITICAL and Kernel ERROR
notifications have been extracted from the PNE Kernel () driver sources and represented in the
form of a database.

The In Service Monitoring Module of ViewCheck would watch out for the occurrence of these
notifications and on detection would send a notification to XML.

The Device Errors are captured and are identified uniquely by ERROR ID. For definition of
ERROR ID, refer

Error ID

on page 19

.

5.6

OOSD Tests

OOSD Tests are used to monitor and manage the performance of the hardware components of
blades. You can execute these tests only when blades are offline, that is blades are not
providing any service.

HDD Health Status

1010

sda1 to sda8
sdb1 to sdb8

Monitors the Health Status of the
sda1 to sda8 partitions on the
HDD and reports.

Network Errors

1020

base1(70), base2(71),
fabric 1(73), fabric 2(74),
rtm1(147),

Monitors the various Error
counters for each of the Network
Device Instances and provides an
error counter exceeds the rate of
change.

Network Counters

1021

base1(70), base2(71),
fabric 1(73), fabric 2(74),
rtm1(147),

Monitors the various counters for
each of the Network Device
Instances

Table 5-28 Monitors (continued)

Monitor Description

Monitor
ID

Valid Device Instances

Remarks

Advertising