This is an old revision of the document!
Sharing data
We don't allow users to open up their private folders using file system permissions or access control lists. This because managing these correctly can be complicated and therefore easily lead to security problems, where users accidentally share data with all other cluster users.
If you need to share data on Hábrók with other users, there are two options.
Group directory
A group directory is useful if you need to share data with a group of users, and the other users on the cluster must not have access to that data. In this case we can set up a group on the cluster for these users and give the group access to one or more shared folders.
These group directories are created on /scratch
for data that needs to be processed and on /projects
for data that needs to be stored safely.
For working with this data there are two models:
- The files in the shared folder are readable and writable for all group members. This with the caveat that users and certain tools can override the default permission settings, making data unreadable or unwritable for others.
- There is a data manager that manages the data in the shared folder, and this data manager is the only person with full write access. All other group members can only read the data.
If you want to request a group directory, please contact hpc@rug.nl
and let us know the following things:
- The proposed name of the group (this name should not be in use and be convenient on the command line). The group name will always be prefixed by
hb-
. - The amount of space needed if more than the default 250 GB are required. Note that for
/projects
there is a fair use principle where you have to pay for storage above a certain threshold. For/scratch
a fair share policy is in place. - Who the primary owner of the group is. This person has to approve the requests for joining the group.
- Who can act as an alternate contact person for the group to approve these requests.
- Do all users need full write access or is there a data manager?
- If there is a data manager, who will fulfill that role?
Public directory
Sometimes you need to share non-sensitive, public data with someone else. For this we have set up a directory /scratch/public
. The data in this directory can be read by all users on the cluster.
Since we have allocated limited space to this directory a cleanup script will remove data after 30 days. Please let us know if you need to share data for a longer period. We can then create a group directory or move the data to a more permanent public location.