GPU allocations problems

eviatar_yadai_KI · October 21, 2021, 10:14am

Hi all,

I start working with OpenPBS with GPU allocations. Resources configured with Cgroup.I have 4 GPUs on my server.

The problem is PBS allocates only GPU 1 and GPU 3. It does not allow more than 4 GPU processes to run in parallel but the allocations (using environment variable ‘CUDA_VISIBLE_DEVICES’) are not allocated well.

e.g if I run 5 jobs with ngpus=1, PBS allocates GPU 1,GPU 3,GPU 1, GPU 3 and then waits for one of the jobs to end before he invokes the fifth job with the available GPU.

btw, if I run with ngpus=2 or 4 it allocates well.

Thank you in advance

Topic		Replies	Views
PBS Single exection host run job using cpu include gpu Users/Site Administrators	5	1787	May 8, 2021
How to configure GPU resource within PBSPro Users/Site Administrators	13	11124	January 7, 2020
CPU + GPU jobs on nodes Users/Site Administrators	1	48	June 4, 2025
Any updates on GPU support since 2010? Users/Site Administrators	4	1907	July 17, 2016
How get allocated gpus on each nodes Users/Site Administrators	11	2852	November 2, 2020

GPU allocations problems

Related topics