Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
dcc:itsol:whisper [2025/09/09 12:39] – [Introduction] Removed VRW referrence giuliodcc:itsol:whisper [2025/09/10 13:25] (current) – Small text edits alba
Line 10: Line 10:
 ===== Introduction ===== ===== Introduction =====
  
-This guide takes you through the steps to set up a personal system of speech-to-text transcription on University of Groningen infrastructure (for UG staff and students) on the basis of the [[https://openai.com/research/whisper|OpenAI Whisper automatic speech recognition (ASR) model]] running on the [[https://iris.service.rug.nl/tas/public/ssp/content/detail/service?unid=0d51dd1aa44f4cdcb4949f1702d1829f|Hábrók High Performance Computing]] (HPC) cluster.+This guide takes you through the steps to set up a personal system of speech-to-text transcription on the University of Groningen infrastructure (for UG staff and students) on the basis of the [[https://openai.com/research/whisper|OpenAI Whisper automatic speech recognition (ASR) model]] running on the [[https://iris.service.rug.nl/tas/public/ssp/content/detail/service?unid=0d51dd1aa44f4cdcb4949f1702d1829f|Hábrók High Performance Computing]] (HPC) cluster.
  
 The process of transcribing spoken audio to text is usually a very time consuming manual process. The UG offers a licensed version of [[https://www.audiotranskription.de/en/f4transkript/|F4 Transkript]] on the University Workplace as an aid for manual transcription, but doesn't offer automatic speech recognition software. The process of transcribing spoken audio to text is usually a very time consuming manual process. The UG offers a licensed version of [[https://www.audiotranskription.de/en/f4transkript/|F4 Transkript]] on the University Workplace as an aid for manual transcription, but doesn't offer automatic speech recognition software.
  
-This guide is offered by the DCC to help researchers process their research data as efficiently as possible, while optimizing data protection (keeping their audio files on UG storage instead of sending it to cloud services). For technical aspects, the service is supported by the Data Science and HPC team of the CIT. If you wish to read more on the detailed functionalities of Whisper, please refer to the [[https://github.com/openai/whisper|manual in their Git repository]].+This guide is offered by the DCC to help researchers process their research data as efficiently as possible, while optimizing data protection (keeping their audio files on UG storage instead of sending them to cloud services). For technical aspects, the service is supported by the Data Science and HPC team of the CIT. If you wish to read more on the detailed functionalities of Whisper, please refer to the [[https://github.com/openai/whisper|manual in their Git repository]].
  
-Should you have any further questions on the use or initial set up of Whisper on Hábrók HPC, please contact the DCC at [[dcc@rug.nl|dcc@rug.nl]].+Should you have any further questions on the use or initial setup of Whisper on Hábrók HPC, please contact the DCC at [[dcc@rug.nl|dcc@rug.nl]].
  
 [[dcc:itsol:whisper:setup| → Move to the next step]] [[dcc:itsol:whisper:setup| → Move to the next step]]