Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
| habrok:connecting_to_the_system:login_nodes [2023/09/11 12:52] – [Interactive GPU node] Changed the hostnames aurel | habrok:connecting_to_the_system:login_nodes [2025/10/23 15:50] (current) – [Long process termination] pedro | ||
|---|---|---|---|
| Line 3: | Line 3: | ||
| ====== Login nodes ====== | ====== Login nodes ====== | ||
| - | Hábrók has five login nodes that can be used to connect to the system. Besides redundancy reasons (you can always try another one if one of them is down), they all serve different purposes. | + | Hábrók has six login nodes that can be used to connect to the system. Besides redundancy reasons (you can always try another one if one of them is down), they all serve different purposes. |
| ===== Login nodes ===== | ===== Login nodes ===== | ||
| - | '' | + | '' |
| We have set up two of these login nodes to increase the availability of the service. | We have set up two of these login nodes to increase the availability of the service. | ||
| Line 17: | Line 17: | ||
| The interactive nodes are about half the size of a default compute node, and they allow for a bit more testing. If you just want to run your program for a couple of minutes, these are the machines to use. Do keep in mind that these are also a shared machines and other people may also want to do some testing. So, if you need to do longer and/or more intensive tests, these tasks should be submitted as jobs. | The interactive nodes are about half the size of a default compute node, and they allow for a bit more testing. If you just want to run your program for a couple of minutes, these are the machines to use. Do keep in mind that these are also a shared machines and other people may also want to do some testing. So, if you need to do longer and/or more intensive tests, these tasks should be submitted as jobs. | ||
| - | To prevent a single user from using all capacity CPU and memory limits are in place. | + | To prevent a single user from using all capacity CPU and memory limits are in place. Further, long running intensive processes will be automatically killed, see section below. |
| - | ===== Interactive GPU node ===== | + | ===== Interactive GPU nodes ===== |
| - | Finally, the interactive GPU nodes, '' | + | Finally, the interactive GPU nodes, '' |
| - | These machines have an NVIDIA | + | These machines have an NVIDIA |
| - | Please keep in mind that this is also a shared machine, and more users want to use the GPU in this machine. So, allow everyone to make use of these GPUs and do not perform long runs here. Long runs should be submitted as jobs to scheduler. | + | Please keep in mind that this is also a shared machine, and more users want to use the GPU in this machine. So, allow everyone to make use of these GPUs and do not perform long runs here. Long runs should be submitted as jobs to scheduler. Long running processes will be automatically killed, see section below. |
| + | |||
| + | ===== Long process termination ===== | ||
| + | |||
| + | Since 2025-10-24, we automatically kill misbehaving processes that have been running for too long and using too many resources on the login, interactive, | ||
| + | |||
| + | ===== Periodic reboots ===== | ||
| + | |||
| + | In order to prevent the login/ | ||