Summary
https://nagios02nagios.slac.stanford.edu/
What is Nagios?
Nagios is an open-source monitoring tool. It is used at SLAC to automatically watch key hosts and services, and to contact appropriate personnel when/if they these services go down.
The primary SLAC Nagios instance is run by SCS. The web interface is available at https://nagios02nagios.slac.stanford.edu/.
How do I use this service?
To view https://nagios.slac.stanford.edu/ you must authenticate with a SLAC-based Unix or Windows account and password, when prompted by a webauth dialogue.
(Put link for how to request a slac-based (*nagios*) account here.)
Please contact unix-admin@slac.stanford.edu to discuss adding your hosts and services to the central SLAC Nagios service; to adjust existing checks; or to request that you be included in alert notifications for a specific host/service.
...
Nagios for Users
Checks
...
This generally works as so:
No Format |
---|
# Acknowledge an alert; stop sending emails remctl -p 46157 nagios02.slac.stanford.edu nagios ack host HOSTNAME COMMENT remctl -p 46157 nagios02.slac.stanford.edu nagios ack service HOSTNAME SERVICENAME COMMENT # Pre-emptively mark a host/service as down, don't contact for a while remctl -p 46157 nagios02.slac.stanford.edu nagios downtime host HOSTNAME HOURS COMMENT remctl -p 46157 nagios02.slac.stanford.edu nagios downtime service HOSTNAME SERVICENAME HOURS COMMENT # Tell nagios to run the check for this host/service in MINUTES minutes remctl -p 46157 nagios02.slac.stanford.edu nagios schedule host HOSTNAME MINUTES COMMENT remctl -p 46157 nagios02.slac.stanford.edu nagios schedule service HOSTNAME SERVICENAME MINUTES COMMENT # Help documents and man pages remctl -p 46157 nagios02.slac.stanford.edu nagios help remctl -p 46157 nagios02.slac.stanford.edu nagios man |
...