Temperatures, Power – HP Integrated Lights-Out User Manual

Page 80

Advertising
background image

Using iLO 2 80

Monitoring the fan sub-system includes the sufficient, redundant, and non-redundant configurations of the

fans. Fan failure is a rare occurrence, but to ensure reliability and uptime, ProLiant servers have redundant
fan configurations. In ProLiant servers that support redundant configurations, fan or fans might fail and still

provide sufficient cooling to continue operation. iLO 2 increases fan control to continue safe operation of

the server in the event of fan failure, maintenance operations, or any event that alters cooling of the

server.
In non-redundant configurations, or redundant configurations where multiple fan failures occur, the system

might become incapable of providing the necessary cooling to protect the system from damage and to

ensure data integrity. In this condition, in addition to the cooling policies, the system might start a graceful

shutdown of the operating system and server.
The Fan tab displays the state of the replaceable fans within the server chassis. This data includes the

area cooled by each fan and the current fan speed.

Temperatures

The Temperatures tab displays the location, status, temperature, and threshold settings of temperature

sensors in the server chassis. The temperature is monitored to maintain the location temperature below the

caution threshold. If one or more sensors exceed this threshold, iLO 2 implements the recovery policy to
prevent damage to server components.

If the temperature exceeds the caution threshold, the fan speed is increased to maximum.

If the temperature exceeds the critical temperature, a graceful server shutdown is attempted.

If the temperature exceeds the fatal threshold, the server is immediately turned off to prevent
permanent damage.

Monitoring policies differ depending on server requirements. Policies usually include increasing fan speed
to maximum cooling, logging the temperature event in the IML log, providing visual indication of the event

using LED indicators, and starting a graceful shutdown of the operating system to avoid data corruption.
After correcting the excessive temperature conditions additional polices are implemented including

returning the fan speed to normal, recording the event in the IML, turning off the LED indicators, and if

appropriate, canceling shutdowns in progress.

Power

The VRMs/Power Supplies tab displays the state of each VRM or power supply. VRMs are required for

each processor in the system. VRMs adjust the power to meet the needs of the processor supported. A

VRM can be replaced if it fails. A failed VRM prevents the processor from being supported.
iLO 2 also monitors power supplies in the system to ensure the longest available uptime of the server and
operating system. Power supplies can be affected by the brownouts and other electrical conditions, or AC

cords can be accidentally unplugged. These conditions result in a loss of redundancy if redundant power

supplies are configured, or result in loss of operation if redundant power supplies are not in use.

Additionally, should a power supply failure be detected (hardware failure) or the AC power cord
disconnected, appropriate events are recorded in the IML and LED indicators used.
iLO 2 monitors power supplies to ensure that they are correctly installed. This information is displayed on

the System Information page. Reviewing the System Information page and IML will assist you in deciding

when to repair or replace a power supply, preventing a disruption in service.

Advertising