Hopefully in Ballam, and via readytalk:
Call in details:
Dial Toll-Free Number: 866-740-1260 (U.S. & Canada)
International participants dial: Toll Number: 303-248-0285
Or International Toll-Free Number:http://www.readytalk.com/intl
Enter your 7-digit access code:
9542853
followed by “#”
Discussion Items:
We need to have a meeting, perhaps on monday, to review the status and lessons learned, and how to get (back) to a "production" level. Some topics:
- Are there problems with the way we have our VMs installed/configured
- Do we have things distributed among VMs in an optimal way (I think we have too much on scalnx-v01 for example)
- Is our documentation on what is running where complete and correct?
- Do we have documentation that would allow people at BNL to diagnose and fix some problems?
- Do we have nagios configured optimally
- Do we need servers-monitoring set up for non-Fermi machine
- Do we need to put things like login, group manager, etc under CCB
- Database monitoring
Action Items:
Update Server Locations and Functions page Brian Van Klaveren
Check that Nagios is monitoring all servers and web applications Charlotte Hee
Set up server monitoring page for non-Fermi servers Massimiliano Turri
Move web applications back to HA machine lsstlnx-v01
- Check that Ganglia is monitoring virtual and physical servers Charlotte Hee
- If they have been monitored: check plots on load for physical machine
- Make sure scalnx-v01 has production only processes
- Follow up with Arash on database monitoring, backups, logs, configurations
- Follow up with Yemi on VM problems