Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
| dcc:pdpsol:publishinghsd [2026/04/10 10:04] – old revision restored (2026/03/30 08:37) alba | dcc:pdpsol:publishinghsd [2026/04/13 11:22] (current) – [Example dataset: Corpus PINO] change data sharing to data transfer agreement marlon | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| {{indexmenu_n> | {{indexmenu_n> | ||
| - | ====== Archiving and Publishing Human Subject Data ====== | + | ===== Archiving and Publishing Human Subject Data ===== |
| - | ===== Introduction | + | ==== Introduction ==== |
| Your project nears its completion. It is time to prepare your data for archiving and publishing in accordance with [[https:// | Your project nears its completion. It is time to prepare your data for archiving and publishing in accordance with [[https:// | ||
| - | ===== What needs to be archived and what can be published? | + | ==== What needs to be archived and what can be published? ==== |
| Check whether you can select data, with two goals of archiving in mind: | Check whether you can select data, with two goals of archiving in mind: | ||
| Line 11: | Line 11: | ||
| * Select and organize the data and other materials that are potentially valuable for further research by you, your team, or fellow researchers. | * Select and organize the data and other materials that are potentially valuable for further research by you, your team, or fellow researchers. | ||
| - | ===== De-identifying data before archiving or publishing ==== | + | ==== De-identifying data before archiving or publishing ==== |
| Often, it is not necessary to keep all collected data for the purpose of validating your findings or for researchers to reuse your data. | Often, it is not necessary to keep all collected data for the purpose of validating your findings or for researchers to reuse your data. | ||
| * Limit the (personal) data and materials you archive to the ones that you need for verification of your research. Follow the procedures in the [[datadesctruction|destruction protocol(s)]] that you designed. Add these protocol(s) to your data package, publication package or archive. (e.g. anonymised consent forms can be archived, while consent forms containing personal data should be de-identified or destroyed in accordance with the UG protocol) | * Limit the (personal) data and materials you archive to the ones that you need for verification of your research. Follow the procedures in the [[datadesctruction|destruction protocol(s)]] that you designed. Add these protocol(s) to your data package, publication package or archive. (e.g. anonymised consent forms can be archived, while consent forms containing personal data should be de-identified or destroyed in accordance with the UG protocol) | ||
| * Determine whether it is possible to [[de-identification|de-identify]] before publishing, while also keeping in mind the usability of your dataset. | * Determine whether it is possible to [[de-identification|de-identify]] before publishing, while also keeping in mind the usability of your dataset. | ||
| - | ===== Publishing | + | ==== Publishing |
| FAIR data does not necessarily mean that all your data and materials need to be openly available. Even after de-identification, | FAIR data does not necessarily mean that all your data and materials need to be openly available. Even after de-identification, | ||
| Consider applying a **‘layered’ approach** to your (de-identified) files by scoring your files in terms of sensitivity. | Consider applying a **‘layered’ approach** to your (de-identified) files by scoring your files in terms of sensitivity. | ||
| - | ====Level 1: contains no personal data ==== | + | === Level 1: contains no personal data === |
| Publish your [[de-identification|(anonymized)]] dataset and supporting materials in a recognized data repository such as [[https:// | Publish your [[de-identification|(anonymized)]] dataset and supporting materials in a recognized data repository such as [[https:// | ||
| - | ====Level 2: contains personal data in de-identified form (not anonymized)==== | + | === Level 2: contains personal data in de-identified form (not anonymized) === |
| Publish your [[de-identification|de-identified dataset]] and supporting materials on [[https:// | Publish your [[de-identification|de-identified dataset]] and supporting materials on [[https:// | ||
| - | ====Level 3: contains sensitive personal data ==== | + | === Level 3: contains sensitive personal data === |
| When your data still contains highly sensitive information, | When your data still contains highly sensitive information, | ||
| Line 35: | Line 38: | ||
| ==== Example dataset: Corpus PINO ==== | ==== Example dataset: Corpus PINO ==== | ||
| <color # | <color # | ||
| - | ---- | + | |
| //“Corpus PINO is a resource designed for research on different styles of spoken Italian and Neapolitan dialect. The corpus consists of anonymized audio recordings and ELAN time-aligned orthographic transcriptions involving fifty participants (stratified by age, gender, and education level). …. PINO is a contribution to the preservation of the local cultural heritage and of a minority language, i.e., an Italo-Romance dialect. It attests the lives, memories, opinions, traditions, practices, and attitudes of fifty members of this community.”// | //“Corpus PINO is a resource designed for research on different styles of spoken Italian and Neapolitan dialect. The corpus consists of anonymized audio recordings and ELAN time-aligned orthographic transcriptions involving fifty participants (stratified by age, gender, and education level). …. PINO is a contribution to the preservation of the local cultural heritage and of a minority language, i.e., an Italo-Romance dialect. It attests the lives, memories, opinions, traditions, practices, and attitudes of fifty members of this community.”// | ||
| ---- | ---- | ||
| Line 53: | Line 56: | ||
| ---- | ---- | ||
| - | === Data sharing | + | === Data transfer |
| - | When an external party requests level 3 or, in some cases, level 2 data, a data transfer agreement needs to be signed. A data transfer agreement is a legal contract that defines the specific purposes for which the data may be used by the requesting party. As such, it is the most comprehensive specification of terms of use. The data sharing | + | When an external party requests level 3 or, in some cases, level 2 data, a data transfer agreement needs to be signed. A data transfer agreement is a legal contract that defines the specific purposes for which the data may be used by the requesting party. As such, it is the most comprehensive specification of terms of use. The data transfer |
| <color # | <color # | ||