1 job allocation information for a running job – HP XC System 3.x Software User Manual

Page 93

Advertising
background image

After LSF-HPC integrated with SLURM allocates nodes for a job, it attaches allocation information to the
job.

The bjobs -l command provides job allocation information on running jobs. The bhist -l command
provides job allocation information for a finished job. For details about using these commands, see the
LSF manpages .

A job allocation information string resembles the following:

slurm_id=slurm_jobid;ncpus=slurm_nprocs;slurm_alloc=node_list

This allocation string has the following values:

slurm_id

SLURM_JOBID

environment variable. This is SLURM allocation ID (Associates LSF-HPC

job with SLURM allocated resources.)

ncpus

SLURM_NPROCS

environment variable. This the actual number of allocated cores. Under

node-level allocation scheduling, this number may be bigger than what the job requests.)

slurm_alloc

A comma separated list of allocated nodes.

LSF-HPC integrated with SLURM sets the SLURM_JOBID and SLURM_NPROCS environment variables,
when it starts a job.

Example 10-1

illustrates how to use the the bjobs -l command to obtain job allocation information about

a running job:

Example 10-1 Job Allocation Information for a Running Job

$ bjobs -l 24

Job <24>, User <lsfadmin>, Project <default>,

Status <RUN>, Queue <normal>,

Interactive pseudo-terminal shell mode,

Extsched <SLURM[nodes=4]>, Command </bin/bash>

date and time stamp: Submitted from host <n2>, CWD <$HOME>,

4 Processors Requested, Requested Resources <type=any>;

date and time stamp: Started on 4 Hosts/Processors <4*lsfhost.localdomain>;

date and time stamp: slurm_id=22;ncpus=8;slurm_alloc=n[5-8];

SCHEDULING PARAMETERS:

r15s r1m r15m ut pg io ls it tmp swp mem

loadSched - - - - - - - - - - -

loadStop - - - - - - - - - - -

EXTERNAL MESSAGES:

MSG_ID FROM POST_TIME MESSAGE ATTACHMENT

0 - - - -

1 lsfadmin date and time stamp SLURM[nodes=4] N

In particular, note the node and job allocation information provided in the above output:

date and time stamp: Started on 4 Hosts/Processors <4*lsfhost.localdomain>;

date and time stamp: slurm_id=22;ncpus=8;slurm_alloc=n[5-8];

Example 10-2

illustrates how to use the output obtained using the bhist -l command to obtain job

allocation information about a job that has run:

10.11 Getting Information About Jobs

93

Advertising