Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
dcc:pdpsol:de-identification [2026/03/17 16:15] marlondcc:pdpsol:de-identification [2026/03/23 14:09] (current) – add go back to P&DP home page marlon
Line 88: Line 88:
 ==== Research specific de-identification techniques ====  ==== Research specific de-identification techniques ==== 
 === Video data === === Video data ===
 +Researchers use video to record real-world behavior, interactions, or experiments in detail, for example, tracking how people move, communicate, or perform tasks over time. It is important to de-identify this type of data, because videos can easily reveal faces, voices, or surroundings, and leaving those visible can reveal participants’ identities.
  
 ++++ Face and body masking |[[https://github.com/MaskAnyone/MaskAnyone|MaskAnyone]] is a de-identification toolbox for videos that allows you to remove personal identifiable information from videos, while at the same time preserving utility. It provides a variety of algorithms that allows you to de-identify or even anonymize videos (video & audio).  ++++ Face and body masking |[[https://github.com/MaskAnyone/MaskAnyone|MaskAnyone]] is a de-identification toolbox for videos that allows you to remove personal identifiable information from videos, while at the same time preserving utility. It provides a variety of algorithms that allows you to de-identify or even anonymize videos (video & audio). 
Line 103: Line 104:
  
 === Audio data === === Audio data ===
-In research, audio recordings are typically collected to capture exactly what participants say during interviews or focus groups. Audio data itself can contain identifying information: Participants may be recognizable from their voice by other people, and modern speech recognition technologies can also be used to identify participants. For this reason, audio data should be de-identified before further use or sharing. +Audio recordings are typically collected to capture exactly what participants say during interviews or focus groups, or to study voice patterns. Audio data itself can contain identifying information: Participants may be recognizable from their voice by other people, and modern speech recognition technologies can also be used to identify participants. For this reason, audio data should be de-identified before further use or sharing. 
  
 ++++ Transcription | ++++ Transcription |
Line 127: Line 128:
   * Network identifiers (e.g. IP addresses)   * Network identifiers (e.g. IP addresses)
   * Device or user IDs (e.g. serial numbers, or account IDs)   * Device or user IDs (e.g. serial numbers, or account IDs)
- 
 ++++  ++++ 
  
 +----
 +[[dcc:pdpsol:start | → Go back to the Privacy & Data protection home page]]