mpirun (intel oneapi) is not working in pbs-server-2021.1.3. but mpiexec.hydra is working fine.
Please find the error.
[mpiexec@node3] check_exit_codes (…/…/…/…/…/src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:117): unable to run bstrap_proxy on node4.head.cm.ibdc.res.in (pid 379277, exit code 256)
[mpiexec@node3] poll_for_event (…/…/…/…/…/src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:159): check exit codes error
[mpiexec@node3] HYD_dmx_poll_wait_for_proxy_event (…/…/…/…/…/src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:212): poll for event error
[mpiexec@node3] HYD_bstrap_setup (…/…/…/…/…/src/pm/i_hydra/libhydra/bstrap/src/intel/i_hydra_bstrap.c:1061): error waiting for event
[mpiexec@node3] HYD_print_bstrap_setup_error_message (…/…/…/…/…/src/pm/i_hydra/mpiexec/intel/i_mpiexec.c:1027): error setting up the bootstrap proxies
[mpiexec@node3] Possible reasons:
[mpiexec@node3] 1. Host is unavailable. Please check that all hosts are available.
[mpiexec@node3] 2. Cannot launch hydra_bstrap_proxy or it crashed on one of the hosts. Make sure hydra_bstrap_proxy is available on all hosts and it has right permissions.
[mpiexec@node3] 3. Firewall refused connection. Check that enough ports are allowed in the firewall and specify them with the I_MPI_PORT_RANGE
Point 2
After job submission output file not generated, generated after completion of job.
Please share the pbs script . Please check whether you have mentioned all the environment varialbes required for IntelMPI in the script. Please let us know about the test done with mpiexec.hydra.
I have run the script as per suggestion, getting errors.
[ test-na]$ cat Test.out
/var/spool/PBS/mom_priv/jobs/281.brahm-login.SC: line 29: -l: command not found
/var/spool/PBS/mom_priv/jobs/281.brahm-login.SC: line 30: /var/spool/PBS/aux/281.brahm-login: Permission denied
0
[mpiexec@node3] i_np_fn (…/…/…/…/…/src/pm/i_hydra/mpiexec/intel/i_mpiexec_params.h:942): process count should be > 0
[mpiexec@node3] match_arg (…/…/…/…/…/src/pm/i_hydra/libhydra/arg/hydra_arg.c:83): match handler returned error
[mpiexec@node3] HYD_arg_parse_array (…/…/…/…/…/src/pm/i_hydra/libhydra/arg/hydra_arg.c:128): argument matching returned error
[mpiexec@node3] mpiexec_get_parameters (…/…/…/…/…/src/pm/i_hydra/mpiexec/mpiexec_params.c:1359): error parsing input array
[mpiexec@node3] main (…/…/…/…/…/src/pm/i_hydra/mpiexec/mpiexec.c:1783): error parsing parameters.
After backtick mention line still error through, Please find the below errors.
[mpiexec@node3] check_exit_codes (…/…/…/…/…/src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:117): unable to run bstrap_proxy on node4.head.cm.ibdc.res.in (pid 414640, exit code 256)
[mpiexec@node3] poll_for_event (…/…/…/…/…/src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:159): check exit codes error
[mpiexec@node3] HYD_dmx_poll_wait_for_proxy_event (…/…/…/…/…/src/pm/i_hydra/libhydra/demux/hydra_demux_poll.c:212): poll for event error
[mpiexec@node3] HYD_bstrap_setup (…/…/…/…/…/src/pm/i_hydra/libhydra/bstrap/src/intel/i_hydra_bstrap.c:1061): error waiting for event
[mpiexec@node3] HYD_print_bstrap_setup_error_message (…/…/…/…/…/src/pm/i_hydra/mpiexec/intel/i_mpiexec.c:1027): error setting up the bootstrap proxies
[mpiexec@node3] Possible reasons:
[mpiexec@node3] 1. Host is unavailable. Please check that all hosts are available.
[mpiexec@node3] 2. Cannot launch hydra_bstrap_proxy or it crashed on one of the hosts. Make sure hydra_bstrap_proxy is available on all hosts and it has right permissions.
[mpiexec@node3] 3. Firewall refused connection. Check that enough ports are allowed in the firewall and specify them with the I_MPI_PORT_RANGE variable.
[mpiexec@node3] 4. pbs bootstrap cannot launch processes on remote host. You may try using -bootstrap option to select alternative launcher.