[#ZBX-17194] Trigger goes first OK and then recovers to OK, instead of PROBLEM and then OK

BgpPeerEstablishedTime.VPN_A[10.1.22.18] = SNMPv2 1.3.6.1.2.1.15.3.1.16.10.1.22.18, numeric (unsigned), units = uptime, update interval = 5m, no preprocessing

BgpPeerState.VPN_A[10.1.22.18] = SNMPv2 .1.3.6.1.2.1.15.3.1.2.10.1.22.18, numeric (unsigned), update interval = 5m, preprocessing: Discard unchanged with heartbeat = 6h

2021-05-19 05:17:20	535
2021-05-19 05:12:19	234
2021-05-19 05:07:21	0
2021-05-19 05:02:20	6643828
2021-05-19 04:57:20	6643528
2021-05-19 04:52:20	6643228

2021-05-19 05:27:16	established (6)
2021-05-19 05:22:20	idle (1)
2021-05-19 05:12:19	established (6)
2021-05-19 05:07:21	idle (1)
2021-05-19 01:22:20	established (6)
2021-05-18 19:17:21	established (6)

   875:20210519:045828.590 sending configuration data to proxy "Proxy-aws-dub-1" at "10.3.99.5", datalen 11343
   877:20210519:050043.818 sending configuration data to proxy "Proxy-site2-v6proxy-1" at "10.2.99.4", datalen 17043
   851:20210519:050432.022 SNMP agent item "net.if.status[ifAdminStatus.Gi0]" on host "site2-inetrtr-1" failed: first network error, wait for 15 seconds
   871:20210519:050451.724 SNMP agent item "net.if.status[ifAdminStatus.Gi0]" on host "site2-inetrtr-1" failed: another network error, wait for 15 seconds
   878:20210519:050455.478 sending configuration data to proxy "Proxy-aws-sto-1" at "10.4.99.5", datalen 10385
   866:20210519:050510.730 SNMP agent item "net.if.subif.status[ifHCOutOctets.Po10.802]" on host "site2-inetrtr-1" failed: another network error, wait for 15 seconds
   871:20210519:050514.731 SNMP agent item "net.if.subif.status[ifOperStatus.Po10.802]" on host "site2-inetrtr-1" failed: another network error, wait for 15 seconds
   866:20210519:050533.738 temporarily disabling SNMP agent checks on host "site2-inetrtr-1": host unavailable
   864:20210519:050856.992 enabling SNMP agent checks on host "site2-inetrtr-1": host became available
   855:20210519:050901.951 SNMP agent item "net.if.subif.status[ifOperStatus.Po10.800]" on host "site2-inetrtr-1" failed: first network error, wait for 15 seconds
   865:20210519:050919.234 resuming SNMP agent checks on host "site2-inetrtr-1": connection restored
   865:20210519:050926.424 SNMP agent item "net.if.status[ifOperStatus.Gi0/0/1]" on host "site2-inetrtr-1" failed: first network error, wait for 15 seconds
   856:20210519:050943.641 resuming SNMP agent checks on host "site2-inetrtr-1": connection restored
   878:20210519:051328.884 sending configuration data to proxy "Proxy-aws-dub-1" at "10.3.99.5", datalen 11343
   875:20210519:051543.903 sending configuration data to proxy "Proxy-site2-v6proxy-1" at "10.2.99.4", datalen 17043
   876:20210519:051955.644 sending configuration data to proxy "Proxy-aws-sto-1" at "10.4.99.5", datalen 10385

(Note about the logs: these SNMP agent errors are expected because the device was being rebooted)

At the same time (05:07:21) there were four other events triggered (due to a device being rebooted, causing events in that device and nearby devices). One of them also resulted in this "OK+OK" at the same second:

"Severity","Time","Recovery time","Status","Host","Problem","Duration"
"Average","2021-05-19 05:07:21","2021-05-19 05:12:21","RESOLVED","site1-corertr-2","BGP peer site2-corertr-1 VPN_C (10.1.29.6) has lost more than 20% of prefixes","5m"
"Average","2021-05-19 05:07:21","2021-05-19 05:07:21","RESOLVED","site1-corertr-2","BGP session with peer site2-corertr-1 (Inet) (10.1.22.18) has been restarted","0"
"High","2021-05-19 05:07:21","2021-05-19 05:12:19","RESOLVED","site1-corertr-2","BGP peer site2-inetrtr-1 (10.1.22.2) is DOWN","4m 58s"
"High","2021-05-19 05:07:21","2021-05-19 05:12:19","RESOLVED","site1-corertr-2","BGP peer site2-corertr-1 (Inet) (10.1.22.18) is DOWN","4m 58s"
"High","2021-05-19 05:07:21","2021-05-19 05:07:21","RESOLVED","site1-corertr-2","No prefixes received for peer site2-corertr-1 (Inet) (10.1.22.18)","0"

(= two of these five were "OK+OK" cases (those with duration of 0), three were normal "PROBLEM+OK" cases)

In the server logs or in the environment behaviour in general I don't see anything strange.

Later in this same morning we got several other of these "OK+OK" cases as well, all from the SNMP items (just like the original ones I described originally when opening the ticket).

The ping case we checked earlier was exceptional: I could not find any other "OK+OK" cases for icmpping items.

We get these SNMP-related "OK+OK" cases a few times each month, basically whenever these BGP SNMP monitoring items cause events.

These have been observed with Zabbix server versions 4.0.15, 4.4.4 (and later 4.4.x) and 5.0.x (currently 5.0.10).If I missed something specific, let me know.

[ZBX-17194] Trigger goes first OK and then recovers to OK, instead of PROBLEM and then OK Created: 2020 Jan 17 Updated: 2024 Apr 10 Resolved: 2022 Jan 25
Status:	Closed
Project:	ZABBIX BUGS AND ISSUES
Component/s:	Server (S)
Affects Version/s:	4.0.15, 5.0.10
Fix Version/s:	None

[ZBX-17194] Trigger goes first OK and then recovers to OK, instead of PROBLEM and then OK Created: 2020 Jan 17 Updated: 2024 Apr 10 Resolved: 2022 Jan 25