University of Cape Town
UCT High Performance Computing SLURM Cluster

Blog       UCT-HPC       Citations       Contact us       Help
Fri Feb 23 04:50:01 SAST 2018

CLUSTER LOAD Hold mouse over bars and indicators for more info. Lamp status
Partitions   
ucthi   uctlo
ucthimem
uctlomem

406

6

407

6

409
M
6

410
M
6

411
M
6

412

6

413

6

414

3

415

0

416

0

417

0

418

0
Disk space:
 / = 11% of 96G
/home = 80% of 504G
/scratch = 89% of 24T
Users logged in:

Head Node load: 0.06     Head Node RAM free: 92%
Currently computing: 1459 hours     Jobs running: 15     Jobs queued: 0
Efficiency: 33%    System overview    Queue accounting    Graphs

JOBS RUNNING
#  JOBID PARTITION              NAME     USER  ACCOUNT      STATE       TIME  CPUS  NODES     NODELIST(REASON)      QOS PRIORITY     CPU TIME
-----------------------------------------------------------------------------------------------------------------------------------------------
1   2563  ucthimem            3605J01 arossgil    maths    RUNNING 1-08:43:21     3      1         srvcnthpc406   normal    10052     4-02:10:03
2   2564  ucthimem            3605J02 arossgil    maths    RUNNING 1-08:43:17     3      1         srvcnthpc406   normal    10052     4-02:09:51
3   2565  ucthimem            3605J03 arossgil    maths    RUNNING 1-08:43:14     3      1         srvcnthpc407   normal    10052     4-02:09:42
4   2566  ucthimem            3605J04 arossgil    maths    RUNNING 1-08:43:11     3      1         srvcnthpc407   normal    10052     4-02:09:33
5   2567  ucthimem            3605J05 arossgil    maths    RUNNING 1-08:43:08     3      1         srvcnthpc409   normal    10052     4-02:09:24
6   2574  ucthimem            3608J01 arossgil    maths    RUNNING 1-08:24:02     3      1         srvcnthpc409   normal    10052     4-01:12:06
7   2575  ucthimem            3608J02 arossgil    maths    RUNNING 1-08:19:48     3      1         srvcnthpc410   normal    10052     4-00:59:24
8   2576  ucthimem            3608J03 arossgil    maths    RUNNING 1-08:19:46     3      1         srvcnthpc410   normal    10052     4-00:59:18
9   2577  ucthimem            3608J04 arossgil    maths    RUNNING 1-08:19:42     3      1         srvcnthpc411   normal    10052     4-00:59:09
10  2578  ucthimem            3608J05 arossgil    maths    RUNNING 1-08:19:39     3      1         srvcnthpc411   normal    10052     4-00:59:00
11  2579  ucthimem            3606J01 arossgil    maths    RUNNING 1-08:15:27     3      1         srvcnthpc412   normal    10052     4-00:46:24
12  2580  ucthimem            3606J02 arossgil    maths    RUNNING 1-08:10:55     3      1         srvcnthpc412   normal    10052     4-00:32:48
13  2581  ucthimem            3606J03 arossgil    maths    RUNNING 1-08:10:47     3      1         srvcnthpc413   normal    10052     4-00:32:24
14  2582  ucthimem            3606J04 arossgil    maths    RUNNING 1-08:10:44     3      1         srvcnthpc413   normal    10052     4-00:32:15
15  2583  ucthimem            3606J05 arossgil    maths    RUNNING 1-08:10:41     3      1         srvcnthpc414   normal    10052     4-00:32:06

CLUSTER STATUS
PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
 ucthimem*    up 208-08:00:      1  down* srvcnthpc408
 ucthimem*    up 208-08:00:      8    mix srvcnthpc[406-407,409-414]
 ucthimem*    up 208-08:00:      4   idle srvcnthpc[415-418]
 uctlomem     up 208-08:00:      1  down* srvcnthpc408
 uctlomem     up 208-08:00:      8    mix srvcnthpc[406-407,409-414]
 uctlomem     up 208-08:00:      4   idle srvcnthpc[415-418]

PartitionName=ucthimem
    AllowGroups=ALL AllowAccounts=ALL AllowQos=ALL
    AllocNodes=ALL Default=YES QoS=N/A
    DefaultTime=NONE DisableRootJobs=NO ExclusiveUser=NO GraceTime=0 Hidden=NO
    MaxNodes=UNLIMITED MaxTime=208-08:00:00 MinNodes=1 LLN=NO MaxCPUsPerNode=UNLIMITED
    Nodes=srvcnthpc[406-418]
    PriorityJobFactor=20 PriorityTier=20 RootOnly=NO ReqResv=NO OverSubscribe=FORCE:4
    OverTimeLimit=NONE PreemptMode=REQUEUE
    State=UP TotalCPUs=104 TotalNodes=13 SelectTypeParameters=NONE
    DefMemPerCPU=2000 MaxMemPerCPU=4000
 
 PartitionName=uctlomem
    AllowGroups=ALL AllowAccounts=ALL AllowQos=ALL
    AllocNodes=ALL Default=NO QoS=N/A
    DefaultTime=NONE DisableRootJobs=NO ExclusiveUser=NO GraceTime=0 Hidden=NO
    MaxNodes=UNLIMITED MaxTime=208-08:00:00 MinNodes=1 LLN=NO MaxCPUsPerNode=UNLIMITED
    Nodes=srvcnthpc[406-418]
    PriorityJobFactor=10 PriorityTier=10 RootOnly=NO ReqResv=NO OverSubscribe=FORCE:4
    OverTimeLimit=NONE PreemptMode=REQUEUE
    State=UP TotalCPUs=104 TotalNodes=13 SelectTypeParameters=NONE
    DefMemPerCPU=2000 MaxMemPerCPU=4000
 

WORKER NODE STATUS
NodeName=srvcnthpc406 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=2.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G6:8
   NodeAddr=srvcnthpc406 NodeHostName=srvcnthpc406 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=24000 FreeMem=21786 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2018-01-23T14:28:04 SlurmdStartTime=2018-01-23T14:28:40
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2564 ucthimem 3605J02 arossgil R 1-08:43:18 srvcnthpc406
      2563 ucthimem 3605J01 arossgil R 1-08:43:22 srvcnthpc406

NodeName=srvcnthpc407 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=2.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G6:8
   NodeAddr=srvcnthpc407 NodeHostName=srvcnthpc407 Version=17.02
   OS=Linux RealMemory=48000 AllocMem=24000 FreeMem=23239 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2018-01-23T15:10:12 SlurmdStartTime=2018-01-23T15:10:35
   CfgTRES=cpu=8,mem=48000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2566 ucthimem 3605J04 arossgil R 1-08:43:12 srvcnthpc407
      2565 ucthimem 3605J03 arossgil R 1-08:43:15 srvcnthpc407

NodeName=srvcnthpc409 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=2.00
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc409 NodeHostName=srvcnthpc409 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=24000 FreeMem=4403 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T12:00:40 SlurmdStartTime=2017-10-06T12:01:10
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2574 ucthimem 3608J01 arossgil R 1-08:24:04 srvcnthpc409
      2567 ucthimem 3605J05 arossgil R 1-08:43:10 srvcnthpc409

NodeName=srvcnthpc410 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=2.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc410 NodeHostName=srvcnthpc410 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=24000 FreeMem=4386 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T12:04:21 SlurmdStartTime=2017-10-06T12:04:59
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2576 ucthimem 3608J03 arossgil R 1-08:19:48 srvcnthpc410
      2575 ucthimem 3608J02 arossgil R 1-08:19:50 srvcnthpc410

NodeName=srvcnthpc411 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=2.00
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc411 NodeHostName=srvcnthpc411 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=24000 FreeMem=4802 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T12:02:16 SlurmdStartTime=2017-10-06T12:02:45
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2578 ucthimem 3608J05 arossgil R 1-08:19:41 srvcnthpc411
      2577 ucthimem 3608J04 arossgil R 1-08:19:44 srvcnthpc411

NodeName=srvcnthpc412 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=2.02
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc412 NodeHostName=srvcnthpc412 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=24000 FreeMem=5038 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T12:02:12 SlurmdStartTime=2017-10-06T12:02:38
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2580 ucthimem 3606J02 arossgil R 1-08:10:57 srvcnthpc412
      2579 ucthimem 3606J01 arossgil R 1-08:15:29 srvcnthpc412

NodeName=srvcnthpc413 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=6 CPUErr=0 CPUTot=8 CPULoad=1.99
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc413 NodeHostName=srvcnthpc413 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=24000 FreeMem=5051 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T11:59:36 SlurmdStartTime=2017-10-06T12:00:14
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=6,mem=24000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2582 ucthimem 3606J04 arossgil R 1-08:10:46 srvcnthpc413
      2581 ucthimem 3606J03 arossgil R 1-08:10:49 srvcnthpc413

NodeName=srvcnthpc414 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=3 CPUErr=0 CPUTot=8 CPULoad=1.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc414 NodeHostName=srvcnthpc414 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=12000 FreeMem=16878 Sockets=2 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T12:00:49 SlurmdStartTime=2017-10-06T14:08:23
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=cpu=3,mem=12000M
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s
      2583 ucthimem 3606J05 arossgil R 1-08:10:43 srvcnthpc414

NodeName=srvcnthpc415 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=0 CPUErr=0 CPUTot=8 CPULoad=0.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G6:8
   NodeAddr=srvcnthpc415 NodeHostName=srvcnthpc415 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=0 FreeMem=28609 Sockets=2 Boards=1
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T11:58:34 SlurmdStartTime=2017-10-06T11:58:56
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

NodeName=srvcnthpc416 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=0 CPUErr=0 CPUTot=8 CPULoad=0.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G1:8
   NodeAddr=srvcnthpc416 NodeHostName=srvcnthpc416 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=0 FreeMem=28886 Sockets=2 Boards=1
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T12:00:36 SlurmdStartTime=2017-10-06T12:01:04
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

NodeName=srvcnthpc417 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=0 CPUErr=0 CPUTot=8 CPULoad=0.02
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G6:8,gpu:kepler:2
   NodeAddr=srvcnthpc417 NodeHostName=srvcnthpc417 Version=17.02
   OS=Linux RealMemory=48000 AllocMem=0 FreeMem=46955 Sockets=2 Boards=1
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2018-02-09T12:45:20 SlurmdStartTime=2018-02-09T13:33:12
   CfgTRES=cpu=8,mem=48000M
   AllocTRES=
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

NodeName=srvcnthpc418 Arch=x86_64 CoresPerSocket=4
   CPUAlloc=0 CPUErr=0 CPUTot=8 CPULoad=0.01
   AvailableFeatures=(null)
   ActiveFeatures=(null)
   Gres=chip:G6:8
   NodeAddr=srvcnthpc418 NodeHostName=srvcnthpc418 Version=17.02
   OS=Linux RealMemory=32000 AllocMem=0 FreeMem=26369 Sockets=2 Boards=1
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
   Partitions=ucthimem,uctlomem 
   BootTime=2017-10-06T11:57:44 SlurmdStartTime=2017-10-06T11:58:07
   CfgTRES=cpu=8,mem=32000M
   AllocTRES=
   CapWatts=n/a
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s