I use NFS to configurate PBS_HOME for pbs primary server and secondary server, when primary server takes over again, secondary server seems not to release the file lock, so the primary server can not work normally. After I stop secondary server, it can take over successfully.
If pacemaker + corosync can instead of primary server and secondary server totally?(Install two pbs server)
With pacemaker and corosync based failover, there is no file locking mechanism.
The PBS_HOME is attached to the active PBS Server.
In case of , PBS Pro legacy failover (Primary /Secondary) , the PBS_HOME is banking on file locking mechanism, the active server that has control on the file lock is controlling the cluster. The other server is just active and checking the heartbeat of the other server. If there are any issues with file locking, then both servers think the control the services ( split brain) and might corrupt the datastore.