Starting the MoM

Thank you :slight_smile:

I am not using the shared storage (maybe I will try it also in the future, I would like to start a bit easier :wink: )

What is quite interesting the same command which fails during authorisation by PBS MoM:

Works without any problems when I run it later by myself:

The error is: key_parse_private2: missing begin marker

Have you experienced such problems? I know it may be the problem with the access permissions, but I donā€™t see any reasonable reason why it could be like this. As always, thank you very much for your help, in advance :slight_smile:

hi guys,
I have similar situation.regarding the HPC user account.I found if I donā€™t add $usecp lines in the MoM config and share the /data folder through NFS. The job will fail. Do we need create individual user account and configure ssh keyless access between headnode and compute node for all HPC user? Thanks

Hi Joey,

The short answer is yes, you need to create accounts and setup pasword-free ssh access between the submission machines and the execution hosts. Please refer to section 12.8 of the PBS Pro Administratorā€™s Guide located here: http://www.pbsworks.com/SupportGT.aspx?d=PBS-Professional,-Documentation

Thanks,

Mike

And does anyone have any idea why the authorisation may not work in my case as described in my previous post?

Thank you for your help :slight_smile:

You need to check whether password-less ssh access is working between the two hosts in question for that particular user who submitted the job.

Regards,
Subhasis

Hi, mkaro

I entered the same issue, but when I tried to add node as root, it return error like this:

[root@pbs-master linux]# qmgr -c ā€œcreate nodes pbs-slaveā€
qmgr: Error (15066) returned from server

I was trying to install pbs in a virtual environment, the OS is CentOS 7.2, please help me

Thank you

Hello @honeyxdyj,

The first thing you want to check is the qmgr command you used. It should be ā€œcreate nodeā€ rather than ā€œcreate nodesā€. The second thing is to determine whether the host ā€œpbs-slaveā€ resolves on your system. Try using the command ā€œhost pbs-slaveā€ to see if the lookup is happening correctly. If not, youā€™ll need to address the hostname resolution issue on your network.

Mike

Hi, mkaro

The command ā€œhost pbs-slaveā€ returned error info like below:

[linux@pbs-master ~]$ host pbs-slave
Host pbs-slave not found: 3(NXDOMAIN)

For the reason I used the virtual machine created by openstack, I asked my colleague,that because the DNS server on our testing environment is not available, but I have configured the /etc/hosts, so it can ping.

So, do you have any ideas?

Iā€™m assuming you also have a pbs-master host? Entries for both must exist in the /etc/hosts file and you must be able to ping each host from the other. At that point, you should be able to add the pbs-slave node to your complex.

Mike

Hi, mkaro

I wonder know that if I install the PBS Pro with root, is there any ways to let me submit jobs as root?

Thanks

To submit jobs as root, please run the below command:
qmgr -c ā€œset server acl_roots=rootā€

By default , root user job submissions are disabled.

Hi, adarsh

qmgr -c ā€œset server acl_roots=rootā€

When you say the ā€œserverā€ , if it means the hostname? And I got this error below:

Unknown Host.
qmgr: cannot connect to server acl_roots=rootā€

server is not the hostname of the server: qmgr -c ā€œset server acl_roots=rootā€
is the correct command.

Please check whether you can run any qmgr commands as root user ?

  • check your pbs server services are running

qmgr -c ā€œset server acl_roots+=rootā€

Missing the ā€œ+ā€ symble, now it works well.
Thank you very much, adarsh.

Hi @mkaro.

We have an hpc system and users have already been created on the headnode and synced to login node. Do we need to sync to compute node as well before setting up passwordless ssh ?

user accounts should exist on the login nodes and compute nodes.
For passwordless-ssh to work , you would need user accounts to exist on both sides.

Thanks everyone for the help. I finally got PBS to run the jobs. The process I used can be found here