Differences

This shows you the differences between two versions of the page.

--- habrok:advanced_job_management:running_jobs_on_gpus [2024/02/05 10:21] – [Running interactive jobs] pedro
+++ habrok:advanced_job_management:running_jobs_on_gpus [2024/11/22 08:10] (current) – [Available GPU types] fokke
@@ Line 18: / Line 18: @@
 ==== Available GPU types ====
-^ Node ^ GPU type ^  GPUs per node ^  Memory per GPU ^  CPUs per node ^  Memory per node ^ Slurm name ^ Notes ^
+^ Node ^ GPU type ^  GPUs per node ^  Memory per GPU ^  CPUs per node ^  Memory per node ^ Slurm name ^
-| A100_1 | Nvidia A100  |  4 |  40 GB |  64 |  512 GB | a100 | Full A100 cards |
+| A100 | Nvidia A100  |  4 |  40 GB |  64 |  512 GB | a100 |
-| A100_2 | Nvidia A100  |  8 |  20 GB |  64 |  512 GB | a100.20gb | Two virtual GPUs per A100 card |
+| V100 | Nvidia V100  |  1 or 2 |  32 GB |  36 |  128 GB | v100 |
-| V100 | Nvidia V100  |  1 |  32 GB |  8|  128 GB | v100 | |
 ==== Example ====
@@ Line 29: / Line 28: @@
 <code>
 #SBATCH --gpus-per-node=a100:2
-</code>
-If you want to request a node with half of an NVIDIA A100, use the following:
-<code>
-#SBATCH --gpus-per-node=a100.20gb:1
 </code>
@@ Line 47: / Line 41: @@
 <code>
-gpu1.hpc.rug.nl
+gpu1.hb.hpc.rug.nl
-gpu2.hpc.rug.nl
+gpu2.hb.hpc.rug.nl
 </code>
-These machines have an NVIDIA V100 GPU each, which can be shared by multiple users. The tool ''nvidia-smi'' will show if the GPU is in use.
+These machines have an NVIDIA L40S GPU each, which can be shared by multiple users. The tool ''nvidia-smi'' will show if the GPU is in use.
 ** Please keep in mind that this is a shared machine, so allow everyone to make use of these GPUs and do not perform long runs here. Long runs should be submitted as jobs to scheduler. **
@@ Line 63: / Line 57: @@
 There is currently an issue with using ''srun --gpus-per-node'', but there is a workaround by using '' --gres'' instead:
 <code>
-srun --gres=1 --time=01:00:00 --pty /bin/bash
+srun --gres=gpu:1 --time=01:00:00 --pty /bin/bash
 </code>
 or:
 <code>
-srun --gres=v100:1 --time=01:00:00 --pty /bin/bash
+srun --gres=gpu:v100:1 --time=01:00:00 --pty /bin/bash
 </code>