Job resource usage can be determined on job completion by checking the following sacct columns;
- MaxRSS - Peak memory usage.
- TotalCPU - Check Elapsed x Alloc ≈TotalCPU
However if you want to record resource usage over the run-time of your job,
#SBATCH --profile task can be added to your SLURM header.
On completion of your job;
Contact us for help analysing the data.
Collate the data into a HDF5 file using the command
sh5util -j <jobid>, it will throw the error
No node-step files found for jobidthis can be ignored,
A file named
job_<JobID>.h5 will be created.