2 running performance health tests – HP XC System 4.x Software User Manual

Page 70

Advertising
background image

7.2 Running Performance Health Tests

You can run the ovp command to generate reports on the performance health of the nodes. Use
the following format to run a specific performance health test:

ovp

[options] [-verify=perf_health/test]

Where:

options

Specify additional command line options for the test. The ovp --help
perf_health

command lists the command line options for each test.

The following options apply to all the tests:

NOTE:

Use the --opts= option to pass this option.

The --nnodes=n option runs the test on n compute nodes.

The --nodelist=nodelist option specifies the compute nodes to use.

NOTE:

The --nodelist=nodelist option is particularly useful for

determining problematic nodes.

If you use this option and the --nnodes=n option, the --nnodes=n option is
ignored.

The --queue LSF_queue option specifies the LSF queue for the performance
health tests.

test

Indicates the test to perform. The following tests are available:
cpu

Tests CPU core performance using the Linpack
benchmark.

cpu_usage

Tests CPU core usage. All CPU cores should be
idle during the test. This test reports a node if it
is using more than 10% (by default) of its CPU
cores.

The head node is excluded from this test.

memory

Uses the streams benchmark to test memory
performance.

memory_usage

Tests memory usage. This test reports a node that
uses more than 25 percent (by default) of its
memory.

network_stress

Tests network performance. Check network
performance under stress using the Pallas
benchmark's Alltoall, Allgather, and Allreduce
tests. These tests should be performed on a large
number of nodes for the most accurate results.

The default value for the number of nodes is 4,
which is the minimum value that should be used.

The --all_group option allows you to select
the node grouping size.

network_bidirectional

Tests network performance between pairs of
nodes using the Pallas benchmark's Exchange
test.

network_unidirectional

Tests network performance between pairs of
nodes using the HP MPI ping_pong_ring test.

70

Monitoring Node Activity

Advertising