About vnode creation for each socket

hiromi · January 10, 2020, 12:13am

Dear All,

Can I create a vnode by specifying a physical CPU (socket)?
I ’d like to execute different types of jobs for each CPU socket.
If you have a way to control jobs on a per-socket basis other than creating a vnode, that’s fine.
Let us know if you have any good ideas.

adarsh · January 10, 2020, 10:07pm

Please check cgroups discussion topic on this forum.

Implementation of cgroups might be helpful in isolating the sockets for the job(s).
core pinning using MPI flavours would be help

It would be helpful to know the kind of jobs or applications you would like to run on the specific socket(s) and the reason behind it ?

hiromi · January 14, 2020, 2:21am

Thanks for the advice.

The background of this question is as follows.
The question is focused on servers with AMD CPU (2 sockets) and GPU.

Multiple GPUs are connected to CPU1 by PCI gen4
GPU is not connected to CPU2
In that case, CPU1 is used for machine learning and CPU2 is used for simulation for JOB
It is a question whether you can use it.

We have already implemented GPU control using cgroups hooks.

I checked 2-cpu executor host.
When controlling with a socket, do I need to make my own hook?
If possible, we ’d like to to separate the job types by queue, so we thought it would be nice to be able to control at the vnode level.

Thanks,

adarsh · January 14, 2020, 9:46am

Thank you @hiromi

1/ Yes , it would be best option
2/ otherwise, map the topology of the system to your cgroup hook.( not sure about the complexity)

The job type can also be part of a hook or it can be part of the Qlist configuration (Reserve resources for a user for particular Job within queue when using Node binding - #2 by adarsh) , so that respective vnode can be targetted

Thank you,

hiromi · January 15, 2020, 6:13am

Thank you Adarsh,

If possible, we 'd like to implement it in a way that does not create a new hook.
What does mapping to a cgroup hook mean? Is it possible to define it in pbs_cgroups.py? Or do you add some definition to the OS side cpuset?

Also, I hope that you can control with Qlist, but for that, I think that you need a vnode defined for each socket. Does this mean that we can’t do this right now?

Thank you.

hiromi · January 17, 2020, 5:31am

I have additional questions.
In this case, if vnode_per_numa_node is enabled in cgroups hook configuration, will a vnode be created for each socket?

Thank you.

mkaro · January 17, 2020, 7:52pm

That is correct. You may find additional information about the cgroup hook in chapter 15 of the PBS Pro Admin Guide. https://www.altair.com/pdfs/pbsworks/PBSAdminGuide19.2.3.pdf

Topic		Replies	Views
Multiple cgroups per vnode -- realistic use cases? Users/Site Administrators	1	856	January 29, 2022
Advanced GPU Scheduling Developers	8	85	July 15, 2025
Specify which GPU to be used in vnode Users/Site Administrators	7	989	July 23, 2021
CGroups hook: vnode_per_numa_node when NUMA bisects sockets Users/Site Administrators	2	1019	December 17, 2019
GPU memory as a custom resource Users/Site Administrators	6	3122	January 15, 2018

About vnode creation for each socket

Related topics