Nagios Probes for Argus (UMD)

Installation

Prerequisites

  • Python 2.4 or newer (not Python 3000)
  • UMD-3 or later (or EMI-3) repository installed
  • Host certificate & IGTF-bundle
  • Nagios server

Where to install

  • This set of probes needs to be installed on the Nagios server only. On the client side (where the Argus services actually run), no action is required.

Install instructions

There is available a rpm called “nagios-plugins-argus”, (nagios-plugins-argus - v. 1.1.0 (EMI-3). It can be installed as follows:

yum install nagios-plugins-argus

This installs the plugins into the /usr/libexec/grid-monitoring/probes/ directory. It is up to the user to bind them into the local Nagios environment.

NOTE: If you use iptables you need to open the ports on client side (where the Argus services are installed):

for IPv4 do:

iptables -I INPUT 1 -p tcp --dport 8154 -j ACCEPT
iptables -I INPUT 1 -p tcp --dport 8152 -j ACCEPT
iptables -I INPUT 1 -p tcp --dport 8150 -j ACCEPT
/etc/init.d/iptables save
/etc/init.d/iptables restart

for IPv6 do:

ip6tables -I INPUT 1 -p tcp --dport 8154 -j ACCEPT
ip6tables -I INPUT 1 -p tcp --dport 8152 -j ACCEPT
ip6tables -I INPUT 1 -p tcp --dport 8150 -j ACCEPT
/etc/init.d/ip6tables save
/etc/init.d/ip6tables restart

Overview of probes

There are 9 different probes available, three for each service:

Service Probe-Name
PAP nagios-plugins-argus.PAP.memory
^ nagios-plugins-argus.PAP.policies
^ nagios-plugins-argus.PAP.status
PDP nagios-plugins-argus.PDP.memory
^ nagios-plugins-argus.PDP.status
^ nagios-plugins-argus.PDP.traffic
PEPD nagios-plugins-argus.PEP.memory
^ nagios-plugins-argus.PEP.status
^ nagios-plugins-argus.PEP.traffic

PAP probes

nagios-plugins-argus.PAP.status

Checks if the service is available. This is done by a call to https://`hostname`:8150/pap/status, if there is a valid response the service is considered up and running.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

nagios-plugins-argus.PAP.memory

The response of the call executed by the status probe is parsed. The value for the current memory consumption is read out and displayed in the Status information field of the Nagios control panel and written into the Performance Data tab of Nagios. Hence it can be displayed as a graph over time by Nagios add-ons, e.g. pnp.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –warning MEM_WARN Memory usage warning threshold in MB 224.0
  –critical MEM_CRIT Memory usage critical threshold in MB 256.0
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

nagios-plugins-argus.PAP.policies

The response of the call executed by the status probe is parsed. The value for the current number of policies in the monitored PAP is read out and displayed in the Status information field of the Nagios control panel and written into the Performance Data tab of Nagios. Hence it can be displayed as a graph over time by Nagios add-ons, e.g. pnp.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

PDP probes

nagios-plugins-argus.PDP.status

Checks if the service is available. This is done by a call to https://`hostname`:8152/status, if there is a valid response the service is considered up and running.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

nagios-plugins-argus.PDP.memory

The response of the call executed by the status probe is parsed. The value for the current memory consumption is read out and displayed in the Status information field of the Nagios control panel and written into the Performance Data tab of Nagios. Hence it can be displayed as a graph over time by Nagios add-ons, e.g. pnp.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –warning MEM_WARN Memory usage warning threshold in MB 224.0
  –critical MEM_CRIT Memory usage critical threshold in MB 256.0
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

nagios-plugins-argus.PDP.traffic

The response of the call executed by the status probe is parsed. The value for the current number of requests executed and the number how many of them have been successful is read out and compared to the same numbers of the former test executed. Out of this the requests per second, the completed requests per second and the erroneous requests per second are computed and written into the Performance Data tab of Nagios. Hence it can be displayed as a graph over time by Nagios add-ons, e.g. pnp. Additionally the number of requests since the last test is written into the Status information field of the Nagios control panel.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –tempdir=TEMP_DIR Storage path for the needed temporary file ../../../../var/lib/grid-monitoring/nagios-plugins-argus
  –tempfile=TEMP_FILE Name for the needed temporary file hostname.nagios-plugins-argus.PDP.traffic.pickle
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

PEP Server probes

nagios-plugins-argus.PEP.status

Checks if the service is available. This is done by a call to https://`hostname`:8154/status, if there is a valid response the service is considered up and running.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –warning MEM_WARN Memory usage warning threshold in MB 224.0
  –critical MEM_CRIT Memory usage critical threshold in MB 256.0
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

nagios-plugins-argus.PEP.memory

The response of the call executed by the status probe is parsed. The value for the current memory consumption is read out and displayed in the Status information field of the Nagios control panel and written into the Performance Data tab of Nagios. Hence it can be displayed as a graph over time by Nagios add-ons, e.g. pnp.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –warning MEM_WARN Memory usage warning threshold in MB 224.0
  –critical MEM_CRIT Memory usage critical threshold in MB 256.0
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False

nagios-plugins-argus.PEP.traffic

The response of the call executed by the status probe is parsed. The value for the current number of requests executed and the number how many of them have been successful is read out and compared to the same numbers of the former test executed. Out of this the requests per second, the completed requests per second and the erroneous requests per second are computed and written into the Performance Data tab of Nagios. Hence it can be displayed as a graph over time by Nagios add-ons, e.g. pnp. Additionally the number of requests since the last test is written into the Status information field of the Nagios control panel.

Options
Mandatory CLI option Description Default
YES –hostname=HOSTNAME The hostname of the service  
  –cert CERT The SSL client certificate /etc/grid-security/hostcert.pem
  –key KEY The private key (the key must be unencrypted) /etc/grid-security/hostkey.pem
  –capath CAPATH The directory where trust anchors are stored on the system /etc/grid-security/certificates
  –tempdir=TEMP_DIR Storage path for the needed temporary file ../../../../var/lib/grid-monitoring/nagios-plugins-argus
  –tempfile=TEMP_FILE Name for the needed temporary file hostname.nagios-plugins-argus.PDP.traffic.pickle
  –help show this help message and exit  
  –port PORT The port of the service 8152
  –url URL The status endpoint URL of the service, e.g. https://hostname:port/status  
  –version show program’s version number and exit  
  –timeout TIMEOUT The TCP timeout for the HTTPS connection in seconds 20
  –verbose verbose mode False