I have a couple VM’s running Ubuntu 18.04 with OpenPBS 20.0.1 and Singularity 3.6.4. One is set up as a host server with the other setup as a compute node with the mom daemon enabled. I’m using LDAP for a naming service to eventually help this cluster manage users. This thread seems related https://community.openpbs.org/t/install-pbspro-on-a-single-node/1155/3
I’m looking for some advice on this error:
qmgr -c “create queue workq queue_type=e,started=t,enabled=t”Unknown Host.
qmgr: cannot connect to server queue
Unknown Host.
qmgr: cannot connect to server workq
Unknown Host.
qmgr: cannot connect to server queue_type=e,started=t,enabled=t”
Here’s some additional info:
qmgr -c ‘p s’
Unknown Host.
qmgr: cannot connect to server s’
qstat -Bf
Server: <hostname>
server_state = Active
server_host = <host_address>
scheduling = True
total_jobs = 0
state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0 Exiting:0 Begun
:0
log_events = 511
mail_from = adm
query_other_jobs = True
resources_default.ncpus = 1
default_chunk.ncpus = 1
scheduler_iteration = 600
FLicenses = 20000000
resv_enable = True
node_fail_requeue = 310
max_array_size = 10000
pbs_license_min = 0
pbs_license_max = 2147483647
pbs_license_linger_time = 31536000
license_count = Avail_Global:10000000 Avail_Local:10000000 Used:0 High_Use:
0
pbs_version = 20.0.1
eligible_time_enable = False
max_concurrent_provision = 5
max_job_sequence_id = 9999999
pbsnodes -a
<node_name>
Mom = <compute_address>
ntype = PBS
state = state-unknown,down
resources_available.host = <compute_name>
resources_available.vnode = <compute_name>
resources_assigned.accelerator_memory = 0kb
resources_assigned.hbmem = 0kb
resources_assigned.mem = 0kb
resources_assigned.naccelerators = 0
resources_assigned.ncpus = 0
resources_assigned.vmem = 0kb
comment = node down: communication closed
resv_enable = True
sharing = default_shared
last_state_change_time = Fri Jan 8 20:13:24 2021