Site Monitoring Services
NOTE: Arvind needs to go over this RSN..
This topic discusses site-level service monitoring (for eg., of compute elements) that is collected and monitored centrally.
In OSG, the RSV
system, as installed on http://software.grid.iu.edu/osg-1.2
1.2.32 compute elements, reports locally scheduled probe results to a central collector operated by the GOC. At present there is no web-visible monitor for this service, but the data upload into the collector is monitored by the GOC staff. More information on RSV is available here
A set of Nagios probe wrappers designed for RSV probe output is being prepared, and should be available in an RSV community probe respository. The Nagios probes (wrappers) can be used for site-level Nagios consoles that may be in use for local fabric monitoring (eg. define an OSG service group) or they may be reported to facility-wide Nagios monitors. At present, there is no OSG-wide Nagios console, but some VOs run these for their facilities and may find these probes useful. More information on this will be made available in the near future (EOT: Spring 2008)
Responsible: -- ArvindGopu
- 06 Jun 2008
Reviewer - date:
Topic revision: r7 - 30 Dec 2009 - 18:20:21 - RobQ