How to find job resource usage
List your jobs (replace USERNAME with your cluster user name):
$ sacct -X --starttime 2025-04-01T00:00:00 --endtime 2025-05-01T23:59:59 --user USERNAME
JobID JobName Partition Account AllocCPUS State ExitCode
------------ ---------- ---------- ---------- ---------- ---------- --------
. . .
21541421 o2_assim CPUQ share-ie-+ 20 COMPLETED 0:0
21541422 o2_assim CPUQ share-ie-+ 20 COMPLETED 0:0
21541423 o2_assim CPUQ share-ie-+ 20 COMPLETED 0:0
Slurm job efficiency report (seff).
$ seff 21541423
Job ID: 21541423
Cluster: idun
User/Group: USERNAME/group_s
State: COMPLETED (exit code 0)
Nodes: 1
Cores per node: 20
CPU Utilized: 05:57:51
CPU Efficiency: 78.76% of 07:34:20 core-walltime
Job Wall-clock time: 00:22:43
Memory Utilized: 5.29 GB
Memory Efficiency: 90.23% of 5.86 GB (5.86 GB/node)
List your jobs with extended formatting (replace USERNAME with your cluster user name):
$ sacct -X --starttime 2025-04-01T00:00:00 --endtime 2025-05-01T23:59:59 --user USERNAME --format jobid,jobname%10,start,elapsed,state%22,node%20
JobID JobName Start Elapsed State NodeList
------------ ---------- ------------------- ---------- ---------------------- --------------------
. . .
21541421 o2_assim 2025-04-10T11:40:56 00:22:47 COMPLETED idun-03-17
21541422 o2_assim 2025-04-10T11:40:56 00:26:21 COMPLETED idun-04-02
21541423 o2_assim 2025-04-10T11:40:56 00:22:43 COMPLETED idun-04-28
Show detailed informating for one job:
$ sacct -j 21541423 --units=G --format jobid,jobname,start,elapsed,state,node,CPUTime,TotalCPU,MaxRSS,MaxDiskRead,MaxDiskWrite,ReqTRES%40,TRESUsageInMax%70
JobID JobName Start Elapsed State NodeList CPUTime TotalCPU MaxRSS MaxDiskRead MaxDiskWrite ReqTRES TRESUsageInMax
------------ ---------- ------------------- ---------- ---------- --------------- ---------- ---------- ---------- ------------ ------------ ---------------------------------------- ----------------------------------------------------------------------
21541423 o2_assim 2025-04-10T11:40:56 00:22:43 COMPLETED idun-04-28 07:34:20 05:57:50 billing=240,cpu=20,mem=5.86G,node=1
21541423.ba+ batch 2025-04-10T11:40:56 00:22:43 COMPLETED idun-04-28 07:34:20 05:57:50 5.29G 259.26G 0.42G cpu=05:57:50,energy=0,fs/disk=259.26G,mem=5.29G,pages=0.00G,vmem=0
21541423.ex+ extern 2025-04-10T11:40:56 00:22:43 COMPLETED idun-04-28 07:34:20 00:00:00 0 0.01M 0.00M cpu=00:00:00,energy=0,fs/disk=0.00G,mem=0,pages=0,vmem=
Explanation:
# CPUTime
Time used (Elapsed time * CPU count) by a job or step in HH:MM:SS format.
# TotalCPU
The sum of the SystemCPU and UserCPU time used by the job or job step.
# MaxRSS
Maximum resident set size of all tasks in job.
# ReqTres
Trackable resources. These are the minimum resource counts requested by the job/step at submission time.
# node%20 or ReqTRES%40
% and number means size of the column. Useful when text is too long.
More about sacct in the Slurm documentation: https://slurm.schedmd.com/sacct.html
Reporting
sreport is used to generate reports of job usage
# sreport -n cluster AccountUtilizationByUser Start=2025-03-01T00:00:00 End=2025-04-01T00:00:00 -T cpu,gres/gpu,gres/gpu:p100,gres/gpu:v100,gres/gpu:a100,gres/gpu:h100,billing format=Accounts%21,Login,TRESName,Used -t hours User=penl
share-group USERNAME cpu 11995
share-group USERNAME gres/gpu 923
share-group USERNAME gres/gpu:p100 50
share-group USERNAME gres/gpu:v100 34
share-group USERNAME gres/gpu:a100 5
share-group USERNAME gres/gpu:h100 923
share-group USERNAME billing 882063