Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

No Format
/reg/common/package/matlab/r2012a/bin/matlab

Batch

...

Farms

There are a number of batch farms (i.e. collections of compute nodes) located in the NEH and FEH. Depending on your data access you may need to submit jobs to a specific farm, by specifying a specific queue. This can be accomplished by submitting to the appropriate LSF batch queue. Refer to the table below. Multi-core OpenMPI jobs should be run in either the psnehmpiq or psfehmpiq batch queue, see the following section on "Submitting OpenMPI Batch Jobs". Simulation jobs should be submitted to the low priority queues psfehidle and psfehidle.

Experimental Hall

Queue

Nodes

Data

Comments

NEH

psnehq

psana11xx,psana12xx

ana01, ana02

Jobs <= 6 cores

 

psnehmpiq

psana11xx,psana12xx

ana01, ana02

OpenMPI jobs > 6 cores, preemptable

 

psnehidle

psana11xx,psana12xx

 

Simulations, preemptable, low priority

FEH

psfehq

psana13xx,psana14xx

ana11, ana12

Jobs <= 6 cores

 

psfehmpiq

psana13xx,psana14xx

ana11, ana12

OpenMPI jobs > 6 cores, preemptable

 

psfehidle

psana13xx,psana14xx

 

Simulations, preemptable, low priority

The batch farms listed above consist of eighty nodes with the following general specifications:

  • 12 cores, Xeon X5675, 24GB memory, 500GB disk, QDR IB

LSF Overview

LSF (Load Sharing Facility) is a job scheduler provided by Platform Computing.  It is used at SLAC to execute user batch jobs on the various batch farms.  A short list of useful LSF commands follows:

Report status of ALL jobs (running, pending, finished, et cetera) submitted by the current user:

bjobs -w -a

Report only running or pending jobs submitted by user "radmer":

bjobs -w -u radmer

Report running or pending jobs for all users in the psnehq queue:

bjobs -w -u all -q psnehq

Report current node usage on the two NEH batch farms:

bhosts -w ps11farm ps12farm

The following links give more detailed LSF usage information:

 can You can find more LCLS specific information about LSF in this this PDF file file. For a more detailed description and more LSF commands, please see:

http://www.slac.stanford.edu/comp/unix/unix-hpc.html

The batch farm is made of eighty servers with the following general specifications:

...

Submitting Batch Jobs (link)

...