Page History

...

Automatic job preemption/suspend/resume?
Support for multiple-levels of job preemption (e.g. 3-queue hierarchy)?
Job environment propagation (including limits like "stacksize")?
Subgroup-specific priority calculation (queue-specific priority formula)?
Capability to delegate subgroup administration privileges (adjust job priorities, suspend, resume, kill) to subgroup administrators?
Cross-queue fairshare (with cpu-speed weighting)?
CPU advanced reservations for MPI?
GPU support?
Ability to submit jobs to hosts where we don't have accounts/home-directories?
Avoid bad behavior when MPI head node reboots: slave node processes get "forgotten" ?
How well does the system scale?
- Number of cores, queues, queued and running jobs?

Please list supported operating systems (for submission hosts and for execution hosts)
Explicit, site-specific-naming resource specification (e.g., rhel5-64, amount of /scratch space, etc.) at job submission time?
Is there an API for "time remaining" query (to save state near end of job)?
Do submission / management hosts a license?
- ability to submit/monitor jobs from any machine at SLAC (not just those with licenses as is the case with LSF)

...

Versions Compared