Summary
https://nagios02nagios.slac.stanford.edu/
What is Nagios?
Nagios is an open-source monitoring tool. It is used at SLAC to automatically watch key hosts and services, and to contact appropriate personnel when/if they these services go down.
The primary SLAC Nagios instance is run by SCS. The web interface is available at https://nagios02nagios.slac.stanford.edu/.
How do I use this service?
To view https://nagios.slac.stanford.edu/ you must authenticate with a SLAC-based Unix or Windows account and password, when prompted by a webauth dialogue.
(Put link for how to request a slac-based (*nagios*) account here.)
Please contact unix-admin@slac.stanford.edu to discuss adding your hosts and services to the central SLAC Nagios service; to adjust existing checks; or to request that you be included in alert notifications for a specific host/service.
...
This generally works as so:
No Format |
---|
# Acknowledge an alert; stop sending emails
remctl nagios02.slac.stanford.edu nagios ack host HOSTNAME COMMENT
remctl nagios02.slac.stanford.edu nagios ack service HOSTNAME SERVICENAME COMMENT
# Pre-emptively mark a host/service as down, don't contact for a while
remctl nagios02.slac.stanford.edu nagios downtime host HOSTNAME HOURS COMMENT
remctl nagios02.slac.stanford.edu nagios downtime service HOSTNAME SERVICENAME HOURS COMMENT
# Tell nagios to run the check for this host/service in MINUTES minutes
remctl nagios02.slac.stanford.edu nagios schedule host HOSTNAME MINUTES COMMENT
remctl nagios02.slac.stanford.edu nagios schedule service HOSTNAME SERVICENAME MINUTES COMMENT
# Help documents and man pages
remctl nagios02.slac.stanford.edu nagios help
remctl nagios02.slac.stanford.edu nagios man
|
...