Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Note

Note that the ability to perform general science analysis at SLAC by the LAT collaboration will be seriously hindered by this outage due to the fact that much of the batch farm will be unavailable.

Action
DateTimeEquipment *Action
A day or two prior to 20 Dec 2019TBATest of power source switching (i.e., normal line power to generator)
Fri 20 Dec 2019TBA switch to generator power (this could happen earlier) This will require a several-hour outage
Mon 6 Jan 2020 TBA return to normal power. This will require a several-hour outage

...

Category†serverVM/servicefunction
XC

fermi-gpfs01

fermi-gpfs02

fermi-gpfs05

fermi-gpfs06

fermi-gpfs07

fermi-gpfs08

xrootdxrootd server and storage
XC/HAfermi-vmclust01/02/03/04fermilnx-v02xrootd redirector
XC/HAfermi-vmclust01/02/03/04fermilnx-v12xrootd redirector
XC

fermi-gpfs03

fermi-gpfs04

GPFSFermi NFS/GPFS storage
XC

fermi-cnfs01

fermi-cnfs02

GPFS/NFS bridgeFermi NFS storage access
HA

staas-gpfs50

staas-gpfs51

 Critical ISOC NFS storage
HAfermilnx01 LAT config, fastcopy and real-time telemetry
HAfermilnx02 LAT config, fastcopy and real-time telemetry
XC/HAfermi-vmclust01/02/03/04fermilnx-v03archiver
HAfermi-oracle03 oracle primary
XCfermi-oracle04 oracle secondary
HA

mysql05

mysql06

mysql-node03calibration, etc. DB
XC400 cores (50 "hequ" equivalents) batch hosts for LISOC
queues={express,short,medium,long,glastdataq}
users={glast,lsstsim,lsstprod,glastmc,glastraw}
XC200 cores
 (25 "hequ" equivalents) batch hosts for Science Pipelines
XC/HAfermi-vmclust01/02/03/04fermilnx-v07/tomcat01Commons, Group manager
XC/HAfermi-vmclust01/02/03/04fermilnx-v16/tomcat06rm2
XC/HAfermi-vmclust01/02/03/04fermilnx-v05/tomcat08dataCatalog
XC/HAfermi-vmclust01/02/03/04fermilnx-v17/tomcat09Pipeline-II
XC/HAfermi-vmclust01/02/03/04fermilnx-v15/pipeline-mail01Pipeline-II email server
XC/HAfermi-vmclust01/02/03/04fermilnx-v18/tomcat10FCWebView, ISOCLogging, MPWebView
TelemetryMonitor, TelemetryTableWebUI
XC/HAfermi-vmclust01/02/03/04fermilnx-v10/tomcat11DataProcessing
XC/HAfermi-vmclust01/02/03/04fermilnx-v11/tomcat12TelemetryTrending
NC(non-Fermi server)astore-new (HPSS)FastCopy data archive
**We have been granted a temporary quota increase of 1 TB on /nfs/farm/g/glast/u23, which has allowed this item to become "NC"**
HA(non-Fermi server)trscrontokenized cron
HA(non-Fermi server)lnxcroncron
XC(non-Fermi server)(farm manager, etc.)LSF management
HAyfs01/NN (non-Fermi) basically all of AFS
HA(non-Fermi server)JIRAissue tracking (HA as of 10/20/2017)
XCrhel6-64 public login nodes (a small number is needed for interactive access)

† Equipment categories

Category
Machine status
NCnon-critical for entire 16-day shutdown period
XCexperiment critical but not in H.A. rack, only a few, short outages acceptable
HAhigh-availability (continuous operation)

...

Machine TypeTotalNotes
GPFS servers8 
NFS/GPFS bridge2 
VMware hypervisors2Not needed if all Fermi services can be moved to the two H.A. hypervisors
batch nodes ("hequ" equivalents)75Depending on which batch nodes are selected, some may already be in H.A. power
Oracle servers1There is rumor that this machine may already be on H.A. power – to be confirmed
Public login nodesN(where "N" is a small integer)
TOTAL88+N 

(red star) Note that HPSS is NOT required by Fermi.

...