Dear PBS community,
First, I am sorry if this question has been asked before.
I am a new user of PBS Pro who is being tasked to manage a cluster.
I have some problem when a job tries to use more than ncpus
than it requests such as in the
following job script:
#!/bin/bash
#PBS -q longnormal
#PBS -l select=1:ncpus=20
cd $PBS_O_WORKDIR
mpirun -n 40 ./a.out
This job will result in one node (each node have max 20 cores) having load about 4000 percent, although resources_used.cpupercent
is only showing 2000 percent.
Probably this can be avoided by using wrapper to mpirun
which I have seen somewhere during my experience as user in some HPC facilities several years ago (then I was only user, not an admin).
I want to know whether there are any solutions to avoid this (by using hooks or other alternatives).
Thank you very much in advance.
Regards,
Fadjar
PS:
I am using PBSPro_13.0.0.151487