Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • we are trying to collect information about upcoming Fermi computing outages (disks, oracle, network) to improve planning
  • when planning an outage, please send an email to datalist and write the description here (including requested duration and preferred timeframe)
  • we will try to combine outages as much as possible, in order to maximize uptime for time-critical services (FASTCopy, pipeline, etc.)
  • once the plan is finalized, don't forget to send a message to glast-outage and the collaboration (if applicable)

Upcoming outage requests

Aug 16, 2012 - site wide power outage.

From John: everything except the servers on the generator will go down. Building 50 is supposed to be the first (or one of the first) buildings brought back up. Power goes off at 5:30 am 8/16. We could have power restrored by 6:30am. Bring up would begin after that, most services back in 2-4 hours. NOTE, however, we tentatively plan to start taking machines down at 17:30 the night before (Aug 15). So we are talking about a ~16 hour outage, if things go well.

July 11, 2012

  • 11:00am - 1:00pm: Replacing a bad fan on sulky34. Since that server holds the LAT raw data, FASTCopy ingestion will be stopped about an hour beforehand to let the pipeline drain.
    Also, the remaining databases will be migrated off of glastlnx01/02 onto mysql-node01.

...