Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

SLURM is new job scheduling system for the LCLS batch compute systems it is replacing the current LSF system. Generic documentation about SLURM can be found in this Quick Start User Guide. Even shorter documentation, some of it specific to psana, can be found in this page.

LCLS specific information about the S3DF facility is here:  Running at S3DF.

General S3DF documentation is here: https://s3df.slac.stanford.edu

Some hardware details about the S3DF can be found here:  https://s3df.slac.stanford.edu/public/doc/#/batch-compute?id=clusters-amp-repos

Partitions

The partition/queue information can be provided by the sinfo command.

...

The following is a simple submission script of a parallel psana batch job run with mpi.  It can be submitted with the command "sbatch submit.sh".  The The commands specified in the script file will be ran on the first available compute node that fits the resources requested.  There are two ideas: "nodes" and "tasks per node".  A "node" is a physical computer box (with a host-name, for example) but each box/node typically has multiple-cpu-cores (see this page for specific numbers:  Batch Nodes And Queues).  Typically the tasks-per-node parameter is set to utilize all the cores on each node.

...