Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We want to improve the robustness and reliability of the batch compute environment system by applying more rigid tighter resource controls. By running jobs in a "sandbox", they are protected The goal is to isolate jobs from each other and cannot consume all of prevent them from consuming all the resources on a machine. LSF version 9.1.2 makes use of linux Control Groups (AKA cgroups) to limit the CPU cores and memory that a job can use. These cgroup resource -based restrictions are currently not in our production LSF configuration. We want to understand the potential impact to users and get feedback from stakeholders. I have outlined some examples below using our test cluster.

...