PP-302: Implement save of PBS data for post-run analysis

hirenvadalia · January 10, 2017, 4:37am

@anamika I have updated design document for your comments please have look again.

kjakkali · January 10, 2017, 8:12am

Change looks good to me. I sign off.

arungrover · January 10, 2017, 11:22pm

Document looks good to me.

anamika · January 11, 2017, 7:08pm

Sorry for not adding a comment but I looked and it looks good.

hirenvadalia · February 16, 2017, 2:36pm

@developers,

FYI: Based on review comment on code, I have update design document to include details of owner of directory in which all post analysis data will be store and what will be permission of this directory.

NOTE: This is just a addition of additional information to design document, there no change in design of saving post analysis data.

anamika · March 5, 2017, 9:55pm

Still looks good to me. spotted a below typo:
“use who invoked pbs_benchpress command” should be “user who has invoked pbs_benchpress command”

hirenvadalia · March 6, 2017, 5:12am

@anamika Thank you. I have fixed typo.

dilip-krishnan · March 6, 2017, 7:45am

Hi @hirenvadalia,
I have one very silly question, Is home directory information collected for each node involve in the test case ?
Like if the test case required a 3 node setup to run, will it collect data from each of these nodes or just of the server node.

Regards
Dilip

hirenvadalia · March 6, 2017, 7:56am

@dilip-krishnan PTL run pbs_diag command to collect data and as of now can be run only on server node. So no it will not collect data from another nodes.
If you feel please create an ticket against pbs_diag to collect data from another nodes as well.

dilip-krishnan · March 6, 2017, 8:13am

Extending pbs_diag wouldn’t be meaningful to collect node information for ptl. This should be part of PTL, since
mom nodes contains config information, logs and core files if generated.

Why the option name is --post-analysis-data, as there is no analysis involve by PTL on the data, also this sounds more like a flag , rather than an option which is requires a directory to store the data. I feel it should be --post-failure-data-dir.

Another question, does ptl collect this data even in case where test case failed due to bug in PTL and does it include any data from PTL side for such failure.

hirenvadalia · March 6, 2017, 8:43am

@dilip-krishnan Why we cann’t extend pbs_diag? Why it should be part of PTL?

And regarding use of pbs_diag, name of options all those stuff is in design document and it is reviewed already. So if you have any points/suggestions please create ticket for same.

dilip-krishnan · March 6, 2017, 9:10am

Isn’t it PTL which is generating the post failure data, why pbs_diag should be extended to achieve something which is not customer(support people) specific requirement. If we extend pbs_diag to collect the data on all node in the complex then this could be very time consuming and meaningless for site containing thousands of node.

I don’t agree creating ticket against pbs_diag would be correct, if any ticket is required for this then it should be for PTL.

hirenvadalia · March 16, 2017, 7:05am

@dilip-krishnan As we discussed offline pbs_diag already collecting job information from compute nodes if -j option is given to it so we can extend pbs_diag to collect other information as well.
Also we can make this collecting information from compute nodes optional so by default it won’t contact compute node so it won’t be time consuming…
Hope this answers all your queries!

dilip-krishnan · March 16, 2017, 6:04pm

Hi @hirenvadalia,
Yeah I understand now why extending pbs_diag would be the right choice, since it pretty much does the similar kind of task.

Regards
Dilip

Topic		Replies	Views
Design for a supported way to change default setup in PTL Developers	49	2164	September 21, 2018
PTL performance test automation Developers	16	1076	January 31, 2019
PP-758: Add pbs_snapshot tool to capture state & logs from PBS Developers	88	6326	October 27, 2017
Update file format and file name for data saved using save_configuration() method in PTL Developers	23	1342	April 1, 2019
PP-1255: PBS Pro design document guidelines Developers	12	940	April 18, 2018

PP-302: Implement save of PBS data for post-run analysis

Related topics