The sinfo command will tell you some useful information about the available partitions on the cluster, including a partition’s time limit, how many nodes are available on that partition, which nodes are available on that partition, and the state of those nodes.
[username@a002 ~]$ sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
interactive up 12:00:00 1 down* g001
interactive up 12:00:00 44 mix c[001-002,005,007,011,014-015,017-018,021,023-024,035,040-043,046-047,055,057-058,060-061,063,066-067,069-072,081-085,088-089],g[002-005,007-008]
interactive up 12:00:00 55 alloc c[003-004,006,008-010,012-013,016,019-020,022,025-034,036-039,044-045,048-054,056,059,062,064-065,068,073-080,086-087,090-092],g006
interactive up 12:00:00 1 idle g009
short* up 12:00:00 8 drng@ cb[001-002,004-005,009-011],gb001
short* up 12:00:00 10 down* c[106,148-153,157-158],g001
short* up 12:00:00 1 resv cb007
short* up 12:00:00 84 mix c[001-002,005,007,011,014-015,017-018,021,023-024,035,040-043,046-047,055,057-058,060-061,063,066-067,069-072,081-085,088-089,105,111,113,116,124-129,138-140,167-168,191,201-202,215-216,220-226],cb[003,006,008,012-020],g[002-005,007-008,010]
short* up 12:00:00 153 alloc c[003-004,006,008-010,012-013,016,019-020,022,025-034,036-039,044-045,048-054,056,059,062,064-065,068,073-080,086-087,090-104,107-110,112,114-115,117-123,130-137,141-147,154-156,159-166,169-190,192-200,203-214,217-219],g006
short* up 12:00:00 10 idle g[009,011-019]
medium up 2-00:00:00 8 drng@ cb[001-002,004-005,009-011],gb001
medium up 2-00:00:00 10 down* c[106,148-153,157-158],g001
medium up 2-00:00:00 1 resv cb007
medium up 2-00:00:00 79 mix c[001-002,005,007,011,014-015,017-018,021,023-024,035,040-043,046-047,055,057-058,060-061,063,066-067,069-072,081-085,088-089,105,111,113,116,124-129,138-140,167-168,191,201-202,215-216,220-221],cb[003,006,008,012-020],g[002-005,007-008,010]
medium up 2-00:00:00 153 alloc c[003-004,006,008-010,012-013,016,019-020,022,025-034,036-039,044-045,048-054,056,059,062,064-065,068,073-080,086-087,090-104,107-110,112,114-115,117-123,130-137,141-147,154-156,159-166,169-190,192-200,203-214,217-219],g006
medium up 2-00:00:00 10 idle g[009,011-019]
long up 5-00:00:00 8 drng@ cb[001-002,004-005,009-011],gb001
long up 5-00:00:00 10 down* c[106,148-153,157-158],g001
long up 5-00:00:00 1 resv cb007
long up 5-00:00:00 79 mix c[001-002,005,007,011,014-015,017-018,021,023-024,035,040-043,046-047,055,057-058,060-061,063,066-067,069-072,081-085,088-089,105,111,113,116,124-129,138-140,167-168,191,201-202,215-216,220-221],cb[003,006,008,012-020],g[002-005,007-008,010]
long up 5-00:00:00 153 alloc c[003-004,006,008-010,012-013,016,019-020,022,025-034,036-039,044-045,048-054,056,059,062,064-065,068,073-080,086-087,090-104,107-110,112,114-115,117-123,130-137,141-147,154-156,159-166,169-190,192-200,203-214,217-219],g006
long up 5-00:00:00 10 idle g[009,011-019]
PARTITION | Name of a partition: short, medium, long, interactive |
---|---|
AVAIL | Partition state: up or down |
TIMELIMIT | Maximum time limit for any user job in days-hours:minutes, default 5 days |
NODES | Count of nodes with this particular configuration |
STATE | State of the nodes. Possible states include: allocated, completing, down, drained, draining, fail, failing, future, idle |
NODELIST | Names of nodes associated with the configuration/partition |
To display more specific info, see man sinfo
For example:
[username@a002 ~]$ sinfo -o '%11P %5D %22N %4c %21G %7m %11l'
PARTITION NODES NODELIST CPUS GRES MEMORY TIMELIMIT
interactive 2 g[001-002] 48 gpu:v100:4 191505 12:00:00
interactive 92 c[001-092] 48 (null) 191507 12:00:00
interactive 7 g[003-009] 64 gpu:a100:4 256735 12:00:00
short* 1 gb001 64 gpu:h200:8(S:0-1) 2063257 12:00:00
short* 2 g[001-002] 48 gpu:v100:4 191505 12:00:00
short* 246 c[001-226],cb[001-020] 48+ (null) 191507+ 12:00:00
short* 17 g[003-019] 64 gpu:a100:4 256735 12:00:00
medium 1 gb001 64 gpu:h200:8(S:0-1) 2063257 2-00:00:00
medium 2 g[001-002] 48 gpu:v100:4 191505 2-00:00:00
medium 241 c[001-221],cb[001-020] 48+ (null) 191507+ 2-00:00:00
medium 17 g[003-019] 64 gpu:a100:4 256735 2-00:00:00
long 1 gb001 64 gpu:h200:8(S:0-1) 2063257 5-00:00:00
long 2 g[001-002] 48 gpu:v100:4 191505 5-00:00:00
long 241 c[001-221],cb[001-020] 48+ (null) 191507+ 5-00:00:00
long 17 g[003-019] 64 gpu:a100:4 256735 5-00:00:00