Failover Setup Issues

For Failover:
Yes shared file system with file locking mechanism is required for PBS Pro failover setup.
Important: Make sure NFS lock services are up and running

Please follow these steps:

Primary Server:

  • install PBS Server with PBS_HOME directory on the NFS
  • submit couple of jobs, to make sure setup works by submitting couple of jobs
  • stop the PBS Services

Secondary Server:

  • install PBS server with PBS_HOME on the local disk
  • stop the PBS Services
  • edit the /etc/pbs.conf and point it to PBS_HOME setup on the NFS
  • start the PBS Services
  • make sure setup works by submitting couple of jobs
  • stop the PBS Services

Note: When starting the services

  • start the primary pbs server first
  • start the second pbs server next

When stopping the services

  • stop the secondary pbs server first
  • stop the primary pbs server next

Caution: Never try to stop and start PBS Services on the primary and secondary in short span of time, always make sure there is some time gap between starting and stopping (vice versa) the services.

It seems the file/folder permissions of the PBS_HOME and sub directories are disturbed. Please follow the above steps, it would work without any issues.