Loading...

XML

Word

Printable

Type: Problem report
Resolution: Fixed
Priority: Major
Fix Version/s: 5.0.7rc1, 5.2.3rc1, 5.4.0alpha1, 5.4 (plan)
Affects Version/s: 5.0.2, 5.0.3
Component/s: Proxy (P), Server (S)
Labels:
None
Environment:
Zabbix 5.0.2 and 5.0.3 (tested with both)
Server, Proxy, Nginx, Frontend all inside Docker
Passive Agent on Macos.

Sprint:
Sprint 71 (Dec 2020)
Story Points:
1

In a somewhat larger Zabbix environment we encountered the problem that when updating from 4.x to 5.0.2, a huge amount of nodata() based triggers spawned new problems just after the successful upgrade of the environment. Now after a minor upgrade from 5.0.2 to 5.0.3 the same happened again, causing several hundrets of false positive alarms.

A deeper investigation in an isolated environment made it possible to reproduce this error behavior: Triggers on hosts monitored via a Zabbix proxy (only tested with ACTIVE proxy) based on the nodata() trigger function and being active (in problem state) will RESOLVE after stopping and starting the Zabbix Server. Apparently, this only happens if between stop and start of the Zabbix Server there is above around 1 minute of time gap. Those resolved triggers will then re-generate new problems after their nodata(xx) time is over, and this way generate new problems, new alerts etc.

Steps to reproduce:

Preparation:

spawned Zabbix server, an active proxy, one (passive only) agent up.
to the existing template "Template Module Zabbix agent", added a trigger "Agent nodata 5m" - "{Template Module Zabbix agent:agent.ping.nodata(5m)}=1" - see attachment "exported_template.xml"
added the active Zabbix proxy to the configuration and make sure it fetches configuration
added two hosts: macbook-direct and macbook-via-proxy, both using the template "Template Module Zabbix agent" - see attachment "exported_hosts.xml"
wait until both hosts get "available"

Result:

Based on the above preparations, note the following procedure:

15:00 - both hosts "ZBX available", no problems present
15:10 - stopped Zabbix agent
15:15 - nodata (and originally existing zabbix-available) triggers spawn problems - see attachment "Screenshot01 - problems active.png"
15:38 - restart Zabbix server (downtime a bit over a minute)
15:39 - nodata-Trigger on the host being monitored via proxy recovers (while the one on the host NOT monitored via proxy stays active)
15:44 - nodata-Trigger on the host monitored via proxy re-triggers -> new problem (Latest data shows clearly, that there have been no values on the item being
responsible for the nodata trigger on the host monitored via proxy since 15:10) - see attachment "Screenshot02 - resolved and new problem.png"

Expected:
Problem on host monitored via proxy should not recover and also not re-spawn. It should just stay active as it was before the Zabbix server restart. See attachment "Screenshot03 - Latest Data.png" to prove there has not been any values on the Item in question during the whole test time frame.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

exported_hosts.xml
2020 Sep 25 18:02
2 kB
Christian Anton
exported_template.xml
2020 Sep 25 18:02
6 kB
Christian Anton
Screenshot01 - problems active.png
2020 Sep 25 18:02
439 kB
Christian Anton
Screenshot02 - resolved and new problem.png
2020 Sep 25 18:02
410 kB
Christian Anton
Screenshot03 - Latest Data.png
2020 Sep 25 18:02
292 kB
Christian Anton
ZBX-18418-50.patch
2020 Dec 09 18:58
1 kB
Michael Veksler

caused by

ZBXNEXT-1891 Implicit trigger dependency when monitored via proxy

Closed

Assignee:: Michael Veksler

Reporter:: Christian Anton

Team:: Team C

Votes:: 8 Vote for this issue

Watchers:: 15 Start watching this issue

Created:: 2020 Sep 25 18:03

Updated:: 2024 Apr 10 17:18

Resolved:: 2020 Dec 14 10:20

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates