One of the two GPU vnodes is offline. Both vnodes are on one physical node. Below is the error read from the MOM log:
10/14/2024 03:30:02;0080;pbs_python;Hook;pbs_python;['Traceback (most recent call last):', ' File "<embedded code object>", line 6505, in main', ' File "<embedded code object>", line 1041, in invoke_handler', ' File "<embedded code object>", line 1283, in _exechost_periodic_handler', ' File "<embedded code object>", line 2868, in bring_node_online', "TypeError: argument of type 'NoneType' is not iterable"]
10/14/2024 03:30:02;0001;pbs_python;Hook;pbs_python;Unexpected error in pbs_cgroups handling exechost_periodic event: TypeError ("argument of type 'NoneType' is not iterable",)
10/14/2024 03:30:03;0100;pbs_mom;Hook;pbs_cgroups;exechost_periodic request rejected by 'pbs_cgroups'
10/14/2024 03:30:03;0100;pbs_mom;Hook;pbs_cgroups;Unexpected error in pbs_cgroups handling exechost_periodic event: TypeError ("argument of type 'NoneType' is not iterable",)
What could that mean?