Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  •  Confirm current H.A. rack occupants.  spreadsheet from Christian Pama
    Old (2017) spreadsheet here (thanks Shirley!) 
  •  Confirm the VM-master for a given VM.  Use the 'node' command, e.g., $ node -whereis fermilnx-v12
  •  Confirm the tomcat <-> service associations.  Table here.
  •  Confirm the tomcat-VM associations in this table. Use the 'node' command, e.g., $ node -whereis glast-tomcat01

Info

NOTE: Fermi has four VMware hypervisors, each of which contain some number of VMs running Fermi services.  Two of these hypervisor machines are in the H.A. racks (fermi-vmclust03/04), while the others (fermi-vmclust01/02) are not.  At this writing there are no user-level tools to allow one to discover which VMs are running on which hypervisor machines.

Category†serverVM/servicefunction
XC

fermi-gpfs01

fermi-gpfs02

fermi-gpfs05

fermi-gpfs06

fermi-gpfs07

fermi-gpfs08

 xrootd server and storage
XCfermi-vmclust01/02/03/04fermilnx-v02xrootd redirector
XCfermi-vmclust01/02/03/04fermilnx-v12xrootd redirector
XC

fermi-gpfs03

fermi-gpfs04

GPFSFermi NFS/GPFS storage
XC

fermi-cnfs01

fermi-cnfs02

GPFS/NFS bridgeFermi NFS storage access
HA

staas-gpfs50

staas-gpfs51

 Critical ISOC NFS storage
HAfermilnx01 LAT config, fastcopy and real-time telemetry
HAfermilnx02 LAT config, fastcopy and real-time telemetry
XCfermi-vmclust01/02/03/04fermilnx-v03archiver
HAfermi-oracle03 oracle primary
XCfermi-oracle04 oracle secondary
HA

mysql05

mysql06

mysql-node03calibration, etc. DB
XC400 cores (25 "hequ" equivalents) batch hosts for LISOC
queues={express,short,medium,long,glastdataq}
users={glast,lsstsim,lsstprod,glastmc,glastraw}
XC200 cores
 (12.5 "hequ" equivalents) batch hosts for Science Pipelines
HAfermi-vmclust01/02/03/04fermilnx-v07/tomcat01Commons, Group manager
XCfermi-vmclust01/02/03/04fermilnx-v16/tomcat06rm2
XCfermi-vmclust01/02/03/04fermilnx-v05/tomcat08dataCatalog
XCfermi-vmclust01/02/03/04fermilnx-v17/tomcat09Pipeline-II
XCfermi-vmclust01/02/03/04fermilnx-v15/pipeline-mail01Pipeline-II email server
XCfermi-vmclust01/02/03/04fermilnx-v18/tomcat10FCWebView, ISOCLogging, MPWebView
TelemetryMonitor, TelemetryTableWebUI
XCfermi-vmclust01/02/03/04fermilnx-v10/tomcat11DataProcessing
XCfermi-vmclust01/02/03/04fermilnx-v11/tomcat12TelemetryTrending
NC(non-Fermi server)astore-new (HPSS)FastCopy data archive
**We have arranged a temporary quota increase of 1 TB on /nfs/farm/g/glast/u23, which has allowed this item to become "NC"**
HA(non-Fermi server)trscrontokenized cron
HA(non-Fermi server)lnxcroncron
XC(non-Fermi server)(farm manager, etc.)LSF management
HAyfs01/NN (non-Fermi) basically all of AFS
HA(non-Fermi server)JIRAissue tracking (HA as of 10/20/2017)
    

...