Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
| dcc:itsol:whisper:datamanage [2025/09/10 13:44] – Minor text changes alba | dcc:itsol:whisper:datamanage [2025/10/30 13:56] (current) – adjusted index numbering giulio | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | {{indexmenu_n> | + | {{indexmenu_n> |
| ===== Data management safety measures ===== | ===== Data management safety measures ===== | ||
| Line 6: | Line 6: | ||
| There are **three main areas** that you need to clear to secure your data: | There are **three main areas** that you need to clear to secure your data: | ||
| * The **audio** that you provided on **input** | * The **audio** that you provided on **input** | ||
| - | * The **transcripts** that Whisper created | + | * The **transcripts** that Whisper created |
| * The **SLURM file** created by HPC to record the job details | * The **SLURM file** created by HPC to record the job details | ||
| Line 15: | Line 15: | ||
| ==== Input Audio ==== | ==== Input Audio ==== | ||
| - | The files contained in the folder '' | + | The files contained in the folder '' |
| Before removing the audio files, we advise you to first check if the transcripts are acceptable. Should you have to run the transcription again with a modified script (i.e., to force a language that Whisper did not automatically identify), then having the audio still on HPC will save you time. | Before removing the audio files, we advise you to first check if the transcripts are acceptable. Should you have to run the transcription again with a modified script (i.e., to force a language that Whisper did not automatically identify), then having the audio still on HPC will save you time. | ||
| Line 23: | Line 23: | ||
| ==== Output Text ==== | ==== Output Text ==== | ||
| - | The files contained in the folder '' | + | The files contained in the folder '' |
| The transcripts created by Whisper come in five different formats: | The transcripts created by Whisper come in five different formats: | ||
| Line 34: | Line 34: | ||
| ==== Job Information File ==== | ==== Job Information File ==== | ||
| - | Finally, there is one last file that needs to be removed before you are done cleaning your HPC environment. In your HOME folder | + | Finally, there is one last file that needs to be removed before you are done cleaning your HPC environment. In your '' |
| * '' | * '' | ||
| - | This file is created by HPC when you launch a job, and it is tagged with the '' | + | This file is created by HPC when you launch a job, and it is tagged with the '' |
| **Note**: If you were curious, SLURM stands for //Simple Linux Utility for Resource Management// | **Note**: If you were curious, SLURM stands for //Simple Linux Utility for Resource Management// | ||
| [[dcc: | [[dcc: | ||