I am proposing to remove the failover feature from OpenPBS core in view of using external tools like Pacemaker and corosync to achieve high availability.
The design can be found here: https://openpbs.atlassian.net/wiki/spaces/PD/pages/2217705478/Removing+failover+code+from+PBS.
Design page to use pacemaker + corosync can be found here:
https://openpbs.atlassian.net/wiki/spaces/PD/pages/2114945027/High+availability+using+pacemaker+and+corosync.