Tuesday, July 19, 2011

A few points about SNMP monitoring

Two levels:
1) SWCTRL is now health checked like any other process: part of the SW-WATCH-HEALTH-CHECK condition.
2) The SNMP agent itself is checked by WATCH: SW-SNMP condition.

Concerning the SNMP agent, the nominal behavior is the following.

WATCH side:
1) Sends an empty datagram to the SNMP agent to wake it up
3) Reads healthcheck socket port from named pipe /tmp/swagent.watch.SHM (timeout of 1ms.)
4) Sends datagram on healthcheck socket
7) Reads response on healthcheck socket from SNMP agent (timeout of 1ms.)

SNMP side:
2) If first time or (WATCH health cycle – 1 reached), open healthcheck socket and send connection port to named pipe /tmp/swagent.watch.SHM
5) Reads datagram received on healthcheck socket (timeout of 1 ms.)
6) Sends back datagram on healthcheck socket