Dear all,
while preparing for an overlay upgrade from OpenPBS version 20.0.1 to 22.05.11 I ran into an issue while starting the pbs
service on the PBS server.
This is a blank dev machine, so no job submissions were hurt during this test. Nontheless, I try to replicate a running setup which I want to upgrade the same path. Here’s the output:
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Log;Log opened
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;pbs_version=22.05.11
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;pbs_build=mach=N/A:security=N/A:configure_args=N/A
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;hostname=pbshead1.dev.tld.local;pbs_leaf_name=N/A;pbs_mom_node_name=N/A
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;ipv4 interface lo: localhost
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;ipv4 interface ens18: pbshead1.dev.tld.local
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;ipv6 interface lo: ip6-loopback
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;ipv6 interface ens18: pbshead1.dev.tld.local
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;ipv6 interface ens18: pbshead1.dev.tld.local
11/28/2022 15:47:40;0002;Server@pbshead1;Svr;Server@pbshead1;ipv6 interface ens18: pbshead1.dev.tld.local
11/28/2022 15:47:40;0006;Server@pbshead1;Fil;Server@pbshead1;Version 22.05.11, started, initialization type = 1
11/28/2022 15:47:41;0002;Server@pbshead1;Svr;Server@pbshead1;pbs_status_db exit code 1
11/28/2022 15:47:41;0002;Server@pbshead1;Svr;Server@pbshead1;Starting PBS dataservice
11/28/2022 15:47:43;0002;Server@pbshead1;Svr;Server@pbshead1;Prepare of statement insert_job failed: ERROR: column "ji_jid" of relation "job" does not exist
LINE 1: ...at,ji_quetime,ji_rteretry,ji_fromsock,ji_fromaddr,ji_jid,ji_...
^ 42703
11/28/2022 15:47:44;0002;Server@pbshead1;Svr;Server@pbshead1;Starting PBS dataservice
11/28/2022 15:47:46;0002;Server@pbshead1;Svr;Server@pbshead1;pbs_status_db exit code 0
11/28/2022 15:47:46;0002;Server@pbshead1;Svr;Server@pbshead1;Prepare of statement insert_job failed: ERROR: column "ji_jid" of relation "job" does not exist
LINE 1: ...at,ji_quetime,ji_rteretry,ji_fromsock,ji_fromaddr,ji_jid,ji_...
^ 42703
(continues with infinite restart attempts)
The server (headnode) is an Ubuntu 18.04 and does not run an execution node (MoM) itself. Two MoMs are connected to the headnode. OpenPBS versions are self-compiled.
I followed the PBS Installation Guide up to chapter 6.5.17.2, that’s where it fails. I also found a related issue report in the forum, but although the pull requests are merged, the issue seems to be the same.
Postgres version:
root@pbshead1:~# /usr/lib/postgresql/10/bin/postgres -V
postgres (PostgreSQL) 10.22 (Ubuntu 10.22-0ubuntu0.18.04.1)
I can provide more logs and information if wanted. Any help appreciated!