Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
dcc:pdpsol:publishinghsd [2026/03/03 14:03] – created marlondcc:pdpsol:publishinghsd [2026/03/03 15:11] (current) – [Publishing de-identified, anonymized or synthetic data] marlon
Line 1: Line 1:
-This page is under construction!+{{indexmenu_n>5}} 
 +====== Archiving and Publishing Human Subject Data ====== 
 + 
 +===== Introduction ===== 
 +Your project nears its completion. It is time to prepare your data for archiving and publishing in accordance with [[https://www.rug.nl/digital-competence-centre/research-data/fair-data-open-science|the FAIR principles]], to make your data //as open "as possible and as closed as necessary"//. When research involves human participants, there is a tension between protecting the privacy of your participants and meeting expectations to archive and publish data so others can verify and reuse your work. Navigating this playing field requires careful planning and thoughtful decisions, putting safeguards in place that protect participants, while still allowing responsible access for future research. You can use the sections below to guide you in this process. 
 +  
 +===== What needs to be archived and what needs to be published? ===== 
 + 
 +Check whether you can further minimize your data, with two goals of archiving in mind: 
 +  * Select and organize the data and other materials that are needed to validate your findings; 
 +  * Select and organize the data and other materials that are potentially valuable for further research by you, your team, or fellow researchers. 
 + 
 +===== De-identifying data before archiving or publishing ==== 
 +Often it is not necessary to keep all collected data for the purpose of validating your findings or for researchers to reuse your data.  
 +  * Limit the (personal) data and materials you archive to the ones that you need for verification of your research. Follow the procedures in the [[datadesctruction|destruction protocol(s)]] that you designed. Add these protocol(s) to your data package, publication package or archive. (e.g. anonymised consent forms can be archived, while consent forms containing personal data can be deleted in accordance with the UG protocol) 
 +  * Determine whether it is possible to [[de-identification|de-identify]] before publishing, while also keeping in mind the usability of your dataset.  
 + 
 +===== Publishing  de-identified, anonymized or synthetic data =====  
 +FAIR data does not necessarily mean that all your data and materials need to openly available. Even after de-identification, there can be [[https://www.rug.nl/digital-competence-centre/research-data/archive-and-publish/make-your-data-available-under-restricted-access|good reasons to restrict access to your data]]. The objective is to have data as open as possible, and as closed and protected as necessary.  
 + 
 +Consider applying a ‘layered’ approach to your (de-identified) files by scoring your files in terms of sensitivity.  
 + 
 +====Category 1: contains no personal data ==== 
 +Publish your dataset in a recognized data repository such as [[https://www.rug.nl/digital-competence-centre/research-data/archive-and-publish/dataversenl|DataverseNL]], on the condition that __**no**__ other [[https://www.rug.nl/digital-competence-centre/research-data/archive-and-publish/make-your-data-available-under-restricted-access|reasons for restricting access]] apply. Allow for reuse by adding a license (for instance, [[https://www.rug.nl/library/open-access/how-to-publish-open-access/creative-commons-licenses|a Creative Commons license]]) and use the persistent identifier (e.g., [[https://www.rug.nl/library/publish/isbn-doi|DOI]]) for data citation. 
 + 
 +====Category 2: contains personal data in pseudonymized form (not anonymized)==== 
 +Publish your dataset in a recognized repository such as [[https://www.rug.nl/digital-competence-centre/research-data/archive-and-publish/dataversenl|DataverseNL]], under restricted access. Determine the terms of access and use for external parties that would like to reuse your data. Make sure that these terms of access align with the informed consent. 
 + 
 +====Category 3: contains sensitive personal data ==== 
 +When your data still contains highly sensitive information, do not publish this data openly or with access controls in a data repository. Instead, [[https://www.rug.nl/digital-competence-centre/research-data/archive-and-publish/|archive your data]] in accordance with the [[https://www.rug.nl/digital-competence-centre/research-data/policies|research data policy of your faculty or institute]]. The UG DCC can assist in developing a procedure for making these sensitive data available for reuse under well-defined conditions. Make sure that these conditions are in line with the informed consent. 
  
 [[dcc:pdpsol:start | → Go back to the Privacy & Data protection home page]] [[dcc:pdpsol:start | → Go back to the Privacy & Data protection home page]]