Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
| dcc:pdpsol:dataminimization [2026/06/04 13:53] – Add feedback Harrie (ethics) marlon | dcc:pdpsol:dataminimization [2026/06/11 08:47] (current) – alba | ||
|---|---|---|---|
| Line 72: | Line 72: | ||
| ===Metadata=== | ===Metadata=== | ||
| Photo, video or audio files might contain a timestamp, date and, depending on the equipment and settings, also location. Check whether you can prevent the collection of these data or remove these metadata as soon as possible after collection. [[https:// | Photo, video or audio files might contain a timestamp, date and, depending on the equipment and settings, also location. Check whether you can prevent the collection of these data or remove these metadata as soon as possible after collection. [[https:// | ||
| + | |||
| + | === Digital traces === | ||
| + | Be aware that bringing a device to an interview can, by itself, generate digital traces. If your phone is on, it may record GPS coordinates, | ||
| + | |||
| + | If you plan on doing interviews with participants, | ||
| ---- | ---- | ||
| Line 103: | Line 108: | ||
| As a researcher, you can reduce the amount of personal data you collect when conducting social media research by carefully selecting your data collection method. Here are two common research approaches, with practical tips for each: | As a researcher, you can reduce the amount of personal data you collect when conducting social media research by carefully selecting your data collection method. Here are two common research approaches, with practical tips for each: | ||
| * **Social media data scraping** is the automated collection of user-generated content and metadata from platforms like X (Formerly Twitter) and YouTube for systematic analysis. Make sure you limit the variables you collect during scraping and define clear filters to your range (e.g. keywords and date range). Consider taking a sample and not scraping all the data that falls within this range. | * **Social media data scraping** is the automated collection of user-generated content and metadata from platforms like X (Formerly Twitter) and YouTube for systematic analysis. Make sure you limit the variables you collect during scraping and define clear filters to your range (e.g. keywords and date range). Consider taking a sample and not scraping all the data that falls within this range. | ||
| - | * **Manual data collection and observation** make it possible to carefully design your data collection and easily prevent the collection of identifiable data. You can determine what data you collect and are less dependent on API. Examples of good practices: 1) Make sure not to collect any usernames, or store them separately from the rest of your data ([[pseudonymization|pseudonymization]]). 2) [[de-identification|De-identify]] other personal identifiable information that is not necessary for your research purpose during data collection. | + | * **Manual data collection and observation** make it possible to carefully design your data collection and easily prevent the collection of identifiable data. You can determine what data you collect and are less dependent on API. Examples of good practices: 1) Make sure not to collect any usernames or store them separately from the rest of your data ([[pseudonymization|pseudonymization]]). 2) [[de-identification|De-identify]] other personal identifiable information that is not necessary for your research purpose during data collection. |