3 getting host load information, 4 examining system queues, 5 getting information about the lsf partition – HP XC System 4.x Software User Manual

Page 96: 11 getting information about jobs, Examining, Getting

Advertising
background image

10.10.3 Getting Host Load Information

The LSF lsload command displays load information for LSF execution hosts.

$ lsload
HOST_NAME status r15s r1m r15m ut pg ls it tmp swp mem
lsfhost.loc ok - - - - - 4 - - - -

In the previous example output, the LSF execution host (lsfhost.localdomain) is listed
under the HOST_NAME column. The status is listed as ok, indicating that it can accept remote
jobs. The ls column shows the number of current login users on this host.

See the OUTPUT section of the lsload manpage for further information about the output of this
example. In addition, see the Platform LSF documentation and the lsload(1) manpage for more
information about the features of this command.

For individual compute node load information, see the discussion on metrics in shownode(1).

10.10.4 Examining System Queues

All jobs on the HP XC system that are submitted to LSF are placed into an LSF job queue. HP
recommends that you examine the status and availability of LSF system queues before launching
a job so that you can select the most appropriate queue for your job.

You can easily verify the status, limits, and configurations of LSF queues with the bqueues
command. This command is fully described in Platform LSF documentation and manpages.

See bsub(1) for more information on submitting jobs to specific queues.

For more information on the bqueues command, see bqueues(1).

10.10.5 Getting Information About the lsf Partition

Information about the SLURM lsf compute node partition can be viewed with the SLURM
sinfo

command. A partition is one or more compute nodes that have been grouped together.

A SLURM lsf partition is created when the HP XC system is installed. This partition contains
the resources that will be managed by LSF and available for jobs submitted to LSF.

The sinfo command reports the state of the lsf partition and all other partitions on the system.
The sinfo command displays a summary of available partition and node information such as
partition names, nodes/partition, and cores/node). It has a wide variety of filtering, sorting, and
formatting options.

The following example shows the use of the sinfo command to obtain lsf partition information:

$ sinfo -p lsf
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
lsf up infinite 128 idle n[1-128]

Use the following command to obtain more information on the nodes in the lsf partition:

$ sinfo -p lsf -lNe
date and time
NODELIST NODES PARTITION STATE CPUS S:C:T MEMORY TMP_DISK WEIGHT FEATURES REASON
n[1-128] 3 lsf idle 2 2:1:1 3892 0 1 (null) none

See

“Getting System Information with the sinfo Command”

and the sinfo(1) manpage and for

further information about using the sinfo command.

10.11 Getting Information About Jobs

There are several ways you can get information about a specific job after it has been submitted
to LSF integrated with SLURM. This section briefly describes some of the commands that are
available under LSF integrated with SLURM to gather information about a job. This section is
only intended to give you an idea of the commonly used commands and to describe any
differences there may be in their operation in the HP XC environment, not as a complete reference

96

Using LSF

Advertising