Channel: Oliver C. Grant

↧

Cluster Slurm Jobs Submission scontrol hold release sinfo

May 13, 2024, 2:14 am

≫ Next: Dev env series of issues

≪ Previous: vdw parameter modifcation glycam aggregation

Submitting to specific nodes:

sbatch –exclude node[001-008] submit.GPU.thor.sh
sbatch –nodelist node010 submit.GPU.thor.sh

Holding jobs in the queue:

This allows you to have jobs in the queue that won’t run even there are resources available. Usually to let others go ahead of you.

scontrol hold $jobID (get the id from doing squeue and looking at leftmost column)
and then:
scontrol release $jobID

Checking status of nodes

sinfo (“mix” is working, “idle” is waiting, both “down” and “drain” are bad):

[oliver@thoreau 0.4.0_glycoproteinLys.pdb]$ sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST 
defq* up infinite 5 down* node[003-007] 
defq* up infinite 1 drain node001 
defq* up infinite 3 mix node[002,009-010] 
mdaas up infinite 2 idle node[008,011]

Checking GPU status:

ssh node001
nvidia-smi
oliver@node009 ~]$ nvidia-smi 
Mon May 13 01:06:36 2024 
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 495.29.05 Driver Version: 495.29.05 CUDA Version: 11.5 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro RTX 5000 On | 00000000:3B:00.0 Off | Off |
| 38% 64C P2 187W / 230W | 754MiB / 16125MiB | 97% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 Quadro RTX 5000 On | 00000000:5E:00.0 Off | Off |
| 33% 38C P2 64W / 230W | 466MiB / 16125MiB | 24% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 2 Quadro RTX 5000 On | 00000000:AF:00.0 Off | Off |
| 33% 23C P8 7W / 230W | 0MiB / 16125MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 3 Quadro RTX 5000 On | 00000000:D8:00.0 Off | Off |
| 33% 23C P8 7W / 230W | 0MiB / 16125MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 289931 C ...ps/amber20/bin/pmemd.cuda 751MiB |
| 1 N/A N/A 293344 C ...ps/amber20/bin/pmemd.cuda 463MiB |
+-----------------------------------------------------------------------------+

GPU-Util will tell you how much work it's doing. The processes should all be on separate GPUs i.e. 0 and 1, not 0 and 0.

↧

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

February 16, 2017, 4:24 pm

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

January 5, 2014, 10:34 pm

Ominde Commission Report and Recommendations – Ominde Report of 1964

March 16, 2015, 5:14 am

Bureau of Internal Revenue: Regional Offices (Directory)

January 9, 2014, 11:06 pm

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

March 26, 2017, 11:23 pm

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

October 17, 2016, 7:20 am

Mp3 Download: Mdu - Kunjenjenjena

December 7, 2017, 8:16 am

How the kill the job , when DTP request running for long hours.

July 26, 2013, 2:41 am

Microsoft Intune から展開しているアプリのアップデートについて

October 17, 2016, 4:11 am

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

September 1, 2017, 10:00 pm

Car crash in Dunton Bassett leaves driver in critical condition

October 7, 2014, 7:51 am

Macky 2, Two Others In Road Accident

March 29, 2015, 5:34 am

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

May 14, 2015, 11:27 pm

Detroit mafia: D’Anna Brothers agree to plea deal

April 21, 2016, 6:56 am

Delivery block field greyed out using VA02

January 26, 2016, 2:52 pm

Muloraki Au

June 22, 2016, 1:44 am

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

October 12, 2017, 2:23 pm

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

February 9, 2018, 4:56 am

FIAT 500 B0111 B0112

July 5, 2018, 10:31 am

© 2025 //www.rssing.com