Running Whisper

Now that you have your general script ready, all you have to do to run it, is to simply copy the command below into your terminal and press enter:

The terminal will then confirm that your job had been received and assign it a jobID. The three messages below will appear in your terminal.

sbatch: Job sent to gpu partition
sbatch: Request for one of the default GPU types added
Submitted batch job <jobID>

Please keep the jobID handy, as that is the quickest way to check how the job is going and if it ran successfully. The figure below shows what the terminal will look like upon successfully launching the script.

Once the job is complete, you will find the transcribed audio in the output folder you specified in the batch script. 

Note: The script shown here will run Whisper on all audio files present in the folder whisper_audio. Please make sure to only have the files that you wish to transcribe in that folder.

To check if the HPC has finished your job, type the following as shown in the figure below:

This will show a list of all the information related to the job. What you want to check is if the “State” parameter says “COMPLETED”, and if the “Reserved walltime” is greater than the “Used walltime”. If both these values are as described here, it means that the HPC is done processing your audio and that the job ran correctly.

If you have launched multiple jobs (which we do not recommend), or if you simply want to see if the job is done without displaying the details of it, type in the following:

This command displays all the active jobs linked to your account. In the list that will appear, look for the jobID that you recorded and, under the column ST, read the letter written there. PD means the job is waiting for resources to be available, R means that the job is running, CG that the job is completing. If your job does not appear in this list, it means that the HPC cluster is done processing your audio.

→ Move to the next step