Differences

This shows you the differences between two versions of the page.

--- habrok:additional_information:course_material:exercises_solutions [2023/09/20 08:26] – [Exercise 8 (DRAFT) - Using local storage within a job] camarocico
+++ habrok:additional_information:course_material:exercises_solutions [2024/11/26 10:29] (current) – Use dokuwiki wp links for wikipedia pedro
@@ Line 4: / Line 4: @@
 These exercises can be followed by anyone having a Hábrók account, as the required data is available to all users. They may therefore be useful if you need to get started with using the cluster, but are not able to follow the basic course in person.
-The accompanying [[https://wiki.hpc.rug.nl/_media/habrok/additional_information/habrok_course.pdf|course slides]] can be found on this wiki by clicking the link. More information on the cluster is also available on the wiki.
+The accompanying {{ :habrok:additional_information:basic_habrok_course.pdf |course slides }} can be found on this wiki by clicking the link. More information on the cluster is also available on the wiki.
 The hostname of the Hábrók cluster to use for the exercises is: ''login1.hb.hpc.rug.nl''. Using one of the other login nodes will work as well.
@@ Line 11: / Line 11: @@
-The end goal of these exercises is to submit two jobs and study the
+The end goal of these exercises is to submit three jobs and study the
 results. The first job will run some R code that generates an animated
 GIF file of the Mandelbrot set. The second job will run a Python script
 on climate data and generates both text output and a plot of temperature
-data of a city.
+data of a city. The third job will train a neural network based rice
+classifiier, which uses many small files as input.
 In the first part of the exercises we are going to use the command-line
-to set up some directories and files for these two jobs. The directories
+to set up some directories and files for these three jobs. The directories
 will contain all the input files (scripts, data) for the different jobs
 that we are going to submit.
 In the second part of the exercises we will write the job scripts for
-both jobs, actually submit them, and, finally, study the results of the
+all three jobs, actually submit them, and, finally, study their results.
-jobs.
 ===== Exercises for Part I =====
@@ Line 66: / Line 66: @@
 You can again just copy and paste the code on the command line: <code>
 [username@login1 ~]$ ls /scratch/$USER/inputfiles
-ex1_mandelbrot.R  ex2_inputfile.csv  ex2_script.py
+dataset.tar.gz  ex1_mandelbrot.R  ex2_inputfile.csv  ex2_script.py  train.py
 [username@login1 ~]$
 </code>
@@ Line 404: / Line 404: @@
 </hidden>
-==== Exercise 4 (DRAFT) - Command-line: set up a directory for training a Neural Network ====
+==== Exercise 4 - Command-line: set up a directory for training a Neural Network ====
 === a.  Change back to the jobs directory ===
@@ Line 593: / Line 593: @@
 <code>
 [username@login1 username]$ ls
+climate.csv  dataset.tar.gz
 [username@login1 username]$
 </code>
@@ Line 609: / Line 610: @@
 ==== Exercise 6 - Using R within a job ====
+In this exercise we will generate an animated image file showing an iterative generation of the Mandelbrot fractal using some code in R. You can find more details on the Mandelbrot set [[wp>Mandelbrot_set|here]].
+The main purpose of the exercise is to learn how to submit R code using a job script to the cluster. Having a nice image as a result is a bonus.
 === a. Go to the job directory for the Mandelbrot job ===
@@ Line 683: / Line 688: @@
   - Did you forget to include the Shebang! line at the top of the file? Better do it now, then.
 <hidden solution>
-You can write the jobscript with the text editor you prefer. Note that instructions for the batch scheduler have to be given in lines starting with ''#SBATCH''. You should not make typos in these lines, as they may either be ignored (if you get the ''#SBATCH'' wrong or will lead to errors. \\
+You can write the jobscript with the text editor you prefer. Note that instructions for the batch scheduler have to be given in lines starting with ''#SBATCH''. You should not make typos in these lines, as they may either be ignored (if you get the ''#SBATCH'' wrong) or will lead to errors. \\
 Here is the information you need to put in:
@@ Line 907: / Line 912: @@
 When you open the file on your local computer you will see an animated version of the Mandelbrot fractal set.
-Details on this calculation can be found at: https://en.wikipedia.org/wiki/Mandelbrot_set
+Details on this calculation can be found [[wp>Mandelbrot_set|here]].
 </hidden>
@@ Line 973: / Line 978: @@
 ==== Exercise 7 - Using Python within a job ====
+In this exercise we will run a Python script that will analyze some temperature data for cities around the world stored in a csv file. The result will be a graph showing the average temperature over a period of time for the city of your choosing.
 === a. Go to the climate job directory ===
@@ Line 1070: / Line 1077: @@
 </hidden>
-== d. Make sure you have a valid city name and submit the job ===
+=== d. Make sure you have a valid city name and submit the job ===
 Make sure that you have replaced //CITYNAME// in script.py by a
 major city (see exercise 3h of the first part), and submit the job.
@@ Line 1147: / Line 1154: @@
 </hidden>
-==== Exercise 8 (DRAFT) - Using local storage within a job ====
+==== Exercise 8 - Using local storage within a job ====
+In this exercise we will train a neural network to recognize the type of rice from a picture of a rice grain. The training data set consists of 14,000 pictures for each type of rice grain, for 5 types. The types being Jasmine, Basmati, Arborio, Ipsala and Karacadag. Next to this the data set has 1,000 pictures for each grain to test the quality of the resulting neural network.
+The data set can be found at: https://www.kaggle.com/datasets/muratkokludataset/rice-image-dataset
+Here are a few sample images:
+|{{:habrok:additional_information:course_material:karacadag_10004_.jpg?nolink|}}|{{:habrok:additional_information:course_material:arborio_100_.jpg?nolink|}}|{{:habrok:additional_information:course_material:jasmine_10003_.jpg?nolink|}}|
+^ Karacadag ^ Arborio ^ Jasmine ^
+The main purpose of the exercise is to show you how to handle data sets containing many small files. In this case 75,000. If you would extract the data set on the /scratch file system, you'll already notice that this is quite slow as /scratch works poorly for handling many small files, as it has been optimized for streaming large files.
+We will therefore not go into more detail about how to use the resulting neural network.
 === a. Go to the rice_classifier job directory ===
@@ Line 1163: / Line 1183: @@
 === b. Start editing a new file for the job script ===
 Create a new and empty file in the editor that will be the job script for this
-exercise. Choose any filename you want (we will use ''rice_classifier.sh''
+exercise. Choose any filename you want (we will use ''rice_classifier.sh'')
 <hidden solution>
 Open up your preferred editor to work on a new file, just as in the previous exercise.
@@ Line 1220: / Line 1240: @@
 </code>
   - Extract the compressed dataset to the right location on the local storage: <code>
-tar xzvf /scratch/$USER/dataset.tar.gz -C $TMPDIR/dataset
+tar xzf /scratch/$USER/dataset.tar.gz -C $TMPDIR/dataset
 </code>
   - Run the training: <code>
@@ Line 1262: / Line 1282: @@
 # Extract the compressed data file to local storage
-tar xzvf /scratch/$USER/dataset.tar.gz -C $TMPDIR/dataset
+tar xzf /scratch/$USER/dataset.tar.gz -C $TMPDIR/dataset
+echo Starting Python program
 # Train the classifier
@@ Line 1290: / Line 1312: @@
 === f. Study the output file ===
-Study the SLURM output file and solve any errors, if necessary.
+Study the SLURM output file and solve any errors, if necessary. Note that tensorflow gives several warnings about not being able to use the CUDA library for a Nvidia GPU, which can be ignored.
 <hidden solution>
 If everything went right no error messages should appear in the output file.
@@ Line 1300: / Line 1322: @@
 and extract the archive<code>
 tar xzvf results.tar.gz</code>
-This is generally now a good idea, since the results might be large / contain lots of files, but it is fine for this particular example.
+This is generally not a good idea, since the results might be large / contain lots of files, but it is fine for this particular example.
-Copy the contents of the ''/tmp/results/plots/'' folder to your local computer and have a look at the plots therein.
+Copy the contents of the ''tmp/results/plots/'' folder to your local computer and have a look at the plots therein.
 <hidden solution>
 You can use the MobaXterm file browser for downloading the file to your desktop or laptop for inspection. For this you need to move the file browser to the climate job directory. Select the png file and click on the download button with the arrow pointing downwards.
 </hidden>
+There will be three plots. One showing the accuracy for the training epochs for both the training and testing data set. The second plot will show the loss function, which is another measure for the accuracy of the neural network. The third plot will show the confusion matrix, which counts how the images in the testing data set where labeled, comparing the predicted labels to the real labels.
 ==== The End ====
 Congratulations! You have submitted your first jobs to the Hábrók cluster. With what you've learned you should be able to write your own job scripts and run calculations on the Hábrók cluster.