Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
dcc:itsol:whisper:scripts [2024/08/07 13:22] – moved content from LibGuides to wiki giulio | dcc:itsol:whisper:scripts [2025/02/03 08:49] (current) – added note on changed dependencies giulio | ||
---|---|---|---|
Line 5: | Line 5: | ||
In order to run the script, you will first have to create it. Open your text editor of choice and copy the highlighted code below into the new file. Save the file with the name: | In order to run the script, you will first have to create it. Open your text editor of choice and copy the highlighted code below into the new file. Save the file with the name: | ||
+ | |||
+ | **Note:** The PyTorch module needed to install Whisper has changed due to an update on the dependencies of Whisper. The module displayed in the screenshots is the previous version. Please make sure to **use the version of the module you find in the text**. | ||
Line 14: | Line 16: | ||
''# | ''# | ||
- | ''# | + | ''# |
- | ''# | + | ''# |
- | ''# | + | ''# |
\\ | \\ | ||
- | '' | + | '' |
'' | '' | ||
- | '' | + | '' |
---- | ---- | ||
Line 32: | Line 34: | ||
---- | ---- | ||
- | |||
- | **You can now close the editor**. | ||
The example below uses the '' | The example below uses the '' | ||
Line 58: | Line 58: | ||
The next three lines specify certain parameters for the batch script: | The next three lines specify certain parameters for the batch script: | ||
- | * ''# | + | * ''# |
This line specifies the maximum time your job will run on the cluster. The format is '' | This line specifies the maximum time your job will run on the cluster. The format is '' | ||
- | * ''# | + | * ''# |
This line tells the cluster that the script is asking for 1 GPU to be allocated to this job. For Whisper, 1 GPU is more than enough to run the transcription, | This line tells the cluster that the script is asking for 1 GPU to be allocated to this job. For Whisper, 1 GPU is more than enough to run the transcription, | ||
- | * ''# | + | * ''# |
This line specifies the amount of Memory/RAM asked for this job. In the default case, the script asks for 16GB of RAM to be allocated. | This line specifies the amount of Memory/RAM asked for this job. In the default case, the script asks for 16GB of RAM to be allocated. | ||
Line 74: | Line 74: | ||
The next two lines make sure that the virtual environment and the dependencies that Whisper needs to run are correctly loaded: | The next two lines make sure that the virtual environment and the dependencies that Whisper needs to run are correctly loaded: | ||
- | * '' | + | * '' |
This line loads the program packages that Whisper needs to run. Please be sure to not modify it, otherwise the script is not going to load the correct dependencies. | This line loads the program packages that Whisper needs to run. Please be sure to not modify it, otherwise the script is not going to load the correct dependencies. | ||
Line 84: | Line 84: | ||
Finally, the last line is the actual command to run Whisper: | Finally, the last line is the actual command to run Whisper: | ||
- | * '' | + | * '' |
- | If you wish to modify the location of the input audio, then you need to specify its '' | + | If you wish to modify the location of the input audio, then you need to specify its '' |
++++ | ++++ | ||
+ | |||
+ | ==== Specialized scripts ==== | ||
+ | |||
+ | The script described above is a general use script. It relies on Whisper to make most of the decisions regarding the transcription. If you need to be more strict on what the program is allowed to do, you might want to use one of the scripts listed below. | ||
+ | |||
+ | It is good practice to create different scripts for different tasks, instead of modifying the same script based on your needs. In this way, you don't have to modify the script again, if you want to execute a task that you already created in the past. This practice helps you keep order and is less prone to errors. | ||
+ | |||
+ | \\ | ||
+ | |||
+ | === Forced English === | ||
+ | |||
+ | This script forces Whisper to transcribe the audio into English. Use this script if the automatic language detection results in the wrong language (i.e. a strong English accent being recognized as Welsh, instead of English). The same concept works for other supported languages, for example Dutch. To change which language is forced, simply substitute the string '' | ||
+ | |||
+ | When you save the script, you can call it '' | ||
+ | |||
+ | ++++ Click to display the script | | ||
+ | |||
+ | ''# | ||
+ | |||
+ | ''# | ||
+ | |||
+ | ''# | ||
+ | |||
+ | ''# | ||
+ | |||
+ | \\ | ||
+ | |||
+ | '' | ||
+ | |||
+ | '' | ||
+ | |||
+ | '' | ||
+ | |||
+ | ++++ | ||
+ | \\ | ||
+ | === Translate instead of transcribe === | ||
+ | |||
+ | Whisper is also capable of translating any X language into English. To let the program know that you wish to see a translation instead of a transcription, | ||
+ | |||
+ | When you save the script, you can call it '' | ||
+ | |||
+ | **Note**: Regardless of whether you run the transcription or the translation first, the file names of the output files will be the exact same. In order for the second operation (translation or transcription) to not overwrite the first, you need to rename the output files before you run the second operation. In this way, the output of your first operation will remain untouched by the second operation. | ||
+ | |||
+ | ++++ Click to display the script | | ||
+ | |||
+ | ''# | ||
+ | |||
+ | ''# | ||
+ | |||
+ | ''# | ||
+ | |||
+ | ''# | ||
+ | |||
+ | \\ | ||
+ | |||
+ | '' | ||
+ | |||
+ | '' | ||
+ | |||
+ | '' | ||
+ | |||
+ | ++++ | ||
+ | |||
+ | \\ | ||
[[dcc: | [[dcc: |