Hi,
I wanted to check if there is any way to check who killed the job ? right now , the comment like below, we do not know who issue the qdel cmd.
comment = Not Running: Insufficient amount of resource: mem (R: 11111111mb
A: 9370557mb T: 11399679mb) and terminated
Thanks.
You can use tracejob and pbs_dtj commandsto find out who deleted the job , or you can check the server logs. Please check the tracejob output below, job was deleted by pbsuser , hence you can see this message in the tracejob output
**07/01/2019 21:57:18 S Job to be deleted at request of pbsuser@pbsserver.pbspro.org**
[pbsuser@pbsserver ~]$ qsub -l select=1:ncpus=100 -- /bin/sleep 1000
11272.pbsserver
[pbsuser@pbsserver ~]$ qstat -answ1
pbsserver:
Req'd Req'd Elap
Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time
------------------------------ --------------- --------------- --------------- -------- ---- ----- ------ ----- - -----
11272.pbsserver pbsuser workq STDIN -- 1 100 -- -- Q -- --
Can Never Run: Insufficient amount of resource: ncpus (R: 100 A: 4 T: 4)
[pbsuser@pbsserver ~]$ qdel 11272
[pbsuser@pbsserver ~]$ qstat
[pbsuser@pbsserver ~]$ tracejob 11272
Job: 11272.pbsserver
07/01/2019 21:57:03 L Considering job to run
07/01/2019 21:57:03 L Insufficient amount of resource: ncpus (R: 100 A: 4 T: 4)
07/01/2019 21:57:03 S Job Queued at request of pbsuser@pbsserver.pbspro.org, owner = pbsuser@pbsserver.pbspro.org, job name = STDIN, queue
= workq
07/01/2019 21:57:03 S Job Modified at request of Scheduler@pbsserver.pbspro.org
07/01/2019 21:57:03 L Job will never run with the resources currently configured in the complex
07/01/2019 21:57:03 S enqueuing into workq, state 1 hop 1
07/01/2019 21:57:18 S delete job request received
07/01/2019 21:57:18 S Job to be deleted at request of pbsuser@pbsserver.pbspro.org
Thanks for the detail. it is helpful.