Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

LCLS has 2 WEKA storage clusters: i) provides the home directories and software for all users and staff, and ii) the FFB  for the user's experiments.  There was a bug with the previous release of WEKA affecting quota. The latest version was installed to fix the quota bug and improve performance.  The team is still working with the WEKA team to deliver a feature for users to check their current quota usage.

...

There have been many incidents where the DSS and MON nodes fell into disrepair during an experiment and communication about their state as well as a checklist for the correct required setup was missing.  To provide transparency across the multiple teams, the IT team has created a confluence page that explains the validation methodology and provides a log of the current functional DSS and MON servers.  This will prevent the teams from guessing which nodes are available and allow the IT team to fix those that are not available.  Coming soon, the IT team will deploy a new image with monitoring that will provide alerts so that actions can be taken in a timely manner.

...