Confluence will be unusable 23-July-2024 at 06:00 due to a Crowd upgrade.
...
The following is a simple submission script of a parallel psana batch job run with mpi. It can be submitted with the command "sbatch submit.sh
". The commands specified in the script file will be ran on the first available compute node that fits the resources requested. There are two ideas: "nodes" and "tasks per node". A "node" is a physical computer box (with a host-name, for example) but each box/node typically has multiple-cpu-cores (see this page for specific numbers: Batch System Analysis JobsNodes And Queues). Typically the tasks-per-node parameter is set to utilize all the cores on each node.
...
Code Block | ||
---|---|---|
| ||
> cat tst_script #!/bin/bash # #SBATCH --job-name=ânameâ<name> # Job name for allocation #SBATCH --output=âfilenameâ%j.log # File to which STDOUT will be written, %j inserts jobid #SBATCH --error=âfilenameâ%j.err # File to which STDERR will be written, %j inserts jobid #SBATCH --partition=anagpu # Partition/Queue to submit job #SBATCH --gres=gpu:1080ti:1 # Number of GPUs #SBATCH --ntask=8 # Total number of tasks #SBATCH --ntasks-per-node=4 # Number of tasks per node #SBATCH --mail-user='username'@slac.stanford.edu # Receive e-mail from slurm #SBATCH --mail-type=ALL # Type of e-mail from slurm; other options are: Error, Info. # srun -l hostname srun python ExampleMultipleChaperones.py > sbatch tst_script Submitted batch job 187 |
...