On July 19, 2013 at about 8:26 AM, Computing Division staff detected a loss of cooled water to Building 50.   Scientific Computing Services staff responded quickly by powering down about 700 batch servers at around 8:45 AM, as temperatures were rising in the machine room.  Services were restored by about 11:15 AM.   Shutting down the servers mitigated problems that might have developed with components inside the systems.

Scientific Computing Services upgraded the batch RTM (Real Time Monitoring) utility to the latest version.   The upgraded version of RTM will function with the current production version of LSF (Load Sharing Facility) and will also work when  the LSF software is upgraded to version 9.1.   RTM provides scientific computing customers with a visual representation of the state of the batch queues.