12 translating slurm and lsf-hpc jobids, Output provided by the bhist command, Using the bhist command (long output) – HP XC System 3.x Software User Manual

Page 110: Table 10-2

Advertising
background image

Table 10-2 Output Provided by the bhist Command

Description

Field

The job ID that LSF-HPC assigned to the job.

JOBID

The user who submitted the job.

USER

The job name assigned by the user.

JOB_NAME

The total waiting time, excluding user suspended time, before the job is dispatched.

PEND

The total user suspended time of a pending job.

PSUSP

The total run time of the job.

RUN

The total user suspended time after the job is dispatched.

USUSP

The total system suspended time after the job is dispatched.

SSUSP

The total unknown time of the job.

UNKWN

The total time that the job has spent in all states.

TOTAL

For detailed information about a finished job, add the -l option to the bhist command, shown
in

Example 10-8

. The -l option specifies that the long format is requested.

Example 10-8 Using the bhist Command (Long Output)

$ bhist -l 24
Job <24>, User <lsfadmin>, Project <default>,
Interactive pseudo-terminal shell mode,
Extsched <SLURM[nodes=4]>, Command </bin/bash>
date and time stamp: Submitted from host <n2>,
to Queue <normal>, CWD <$HOME>,
4 Processors Requested, Requested Resources <type=any>;

date and time stamp: Dispatched to 4 Hosts/Processors
<4*lsfhost.localdomain>;
date and time stamp: slurm_id=22;ncpus=8;slurm_alloc=n[5-8];
date and time stamp: Starting (Pid 4785);

Summary of time in seconds spent in various states by
date and time stamp
PEND PSUSP RUN USUSP SSUSP UNKWN TOTAL
11 0 124 0 0 0 135

10.12 Translating SLURM and LSF-HPC JOBIDs

LSF-HPC and SLURM are independent resource management components of the HP XC system.
They maintain their own job identifiers (JOBIDs). It may be useful to be able to determine which
the SLURM_JOBID environment variable matches an LSF JOBID, and vice versa.

When a job is submitted to LSF-HPC, it is given an LSF JOBID, as in this example:

$ bsub -o %J.out -n 8 sleep 300
Job <99> is submitted to default queue <normal>

The following is the sequence of events when a SLURM JOBID is assigned:

No SLURM_JOBID exists while the job is PENDing in LSF-HPC.

After LSF-HPC determines that the resources are available in SLURM for this job, LSF-HPC
requests an allocation in SLURM.

After the SLURM allocation is established, there is a corresponding SLURM JOBID for the
LSF JOBID.

Use the bjobs command to view the SLURM JOBID:

110

Using LSF-HPC

Advertising