Other folks with a related problem usually seem to have some value following "Invalid local hostname: ", even if it’s incorrect, but my instance of the service doesn’t seem to be able to read anything.
Here are the contents of my /etc/pbs.conf file, /etc/hosts file, and the outputs from hostname and hostname -f:
The value of PBS_LEAF_NAME should be the hostname of the interface over which you want PBS Pro to communicate, not the domain name. Unless you have multiple NICs, you should not need to set this value at all. Please try removing PBS_LEAF_NAME from /etc/pbs.conf and try starting PBS Pro.
I’m fairly sure this issue is unrelated to that originally presented in the thread, however the system setup remains same. Again, any help would be appreciated and let me know if you need more information.
The output from these commands should help to diagnose the problem. It’s very unusual that the scheduler failed to start. After you stop PBS, please make sure there are no lingering processes (e.g. ps -ef | grep pbs).
Stopping was successful and no related processes appeared to be lingering, (only grep result was the grep itself), however startup failed. Could this be an issue with permissions?
and provide the output. PBS performs some sanity checks when it starts to make sure certain files and directories are present and have the expected permissions. Is /var/spool/pbs/sched_priv a symbolic link?
I don’t see anything unusual in the output you provided. The one thing that puzzles me is the log message:
pbs_sched: Operation not permitted (1) in chk_file_sec, Security violation "/var/spool/pbs/sched_priv" resolves to "/var/spool/pbs"
I would have expected the two paths to be identical. I don’t believe xfs is a problem, though I tend to use ext4. Are there hidden attributes in XFS that could be affecting permissions?
You might try moving /var/spool/pbs (PBS_HOME in /etc/pbs.conf) to another filesystem and updating /etc/pbs.conf to point to the new location.
Interestingly enough, a fresh attempt at initialization of the service appeared successful despite making no recent alterations. Restarting the service appeared equally successful. However, unless I am misunderstanding something, none of the functions of the service appear operational:
Our firewall is definitely disabled, but could this point to some other disruption of communication between the command line and the pbs server? Also, could this in any way be related to the previous issue with the scheduler?
$ telnet -4 pbs-server 15001
Trying 192.168.1.2...
Connected to pbs-server.
Escape character is '^]'.
+2+15+15056+0+1Connection closed by foreign host.
When I try connecting to an unused port I see this:
Hi, everyone.
I am install pbs pro in ubuntu 18.04 in a single node.
I have come across this error when I tried to start the pbs server.
$ sudo /etc/init.d/pbs start
Starting PBS
/opt/pbs/sbin/pbs_comm ready (pid=26546), Proxy Name:localhost:17001, Threads:4
PBS comm
PBS mom
PBS sched
/opt/pbs/sbin/pbs_ds_systemd: 43: [: xdegraded: unexpected operator
Connecting to PBS dataservice…connected to PBS dataservice@localhost
PBS server
#And When I type
$ /etc/init.d/pbs status
pbs_server is not running
pbs_mom is pid 26575
pbs_sched is not running
pbs_comm is not running
Could you suggest what might be the problem?
Thank you.