-
Problem report
-
Resolution: Unresolved
-
Blocker
-
7.0.6, 7.2.1
-
None
-
RHEL 9.5
-
S24-W50/51/52/1, S25-W2/3
-
2
Steps to reproduce:
- Monitor with SNMP template some big device like Extreme Networks
- Use old SNMP approach, with discovery[] rules configured update interval 1h
- Use Timeout 30 (Server) and 29 (Proxy)
- Start monitor device through Proxy, get first connection errors
- See unreachable pollers go high, no metrics at all can be gathered from the device, since devices is flooded with discovery[] requests.
- See tcpdump traffic and logs - Zabbix starts overloading device with discovery[] walks every 1 minute. See the logs, discovery OID's every 1 minute are listed.
- Disabling discovery rules offload unreachable pollers immediately + device starts answering with common items.
Result:
1714:20250102:131040.025 SNMP agent item ".1.3.6.1.4.1.2272.1.101.1.1.2.1.3.[1]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1737:20250102:131046.043 SNMP agent item "1.3.6.1.4.1.2272.1.4.8.1.1.2.[1]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1735:20250102:131047.080 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1777:20250102:131102.645 resuming SNMP agent checks on host "KNOWHERE": connection restored 1773:20250102:131126.792 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1775:20250102:131141.652 resuming SNMP agent checks on host "KNOWHERE": connection restored 1741:20250102:131205.705 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1763:20250102:131238.697 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1711:20250102:131240.722 resuming SNMP agent checks on host "KNOWHERE": connection restored 1797:20250102:131316.077 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1748:20250102:131343.402 SNMP agent item ".1.3.6.1.4.1.2272.1.101.1.1.2.1.3.[5]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1719:20250102:131349.421 SNMP agent item ".1.3.6.1.4.1.2272.1.101.1.1.2.1.3.[9]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1764:20250102:131404.735 resuming SNMP agent checks on host "KNOWHERE": connection restored 1730:20250102:131428.539 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1749:20250102:131443.780 resuming SNMP agent checks on host "KNOWHERE": connection restored 1780:20250102:131507.888 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1794:20250102:131540.125 SNMP agent item ".1.3.6.1.4.1.2272.1.101.1.1.2.1.3.[9]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1760:20250102:131555.637 resuming SNMP agent checks on host "KNOWHERE": connection restored 1740:20250102:131619.612 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1800:20250102:131634.632 resuming SNMP agent checks on host "KNOWHERE": connection restored 1755:20250102:131658.803 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1756:20250102:131731.782 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1713:20250102:131746.636 resuming SNMP agent checks on host "KNOWHERE": connection restored 1748:20250102:131810.314 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1747:20250102:131840.284 SNMP agent item "1.3.6.1.4.1.2272.1.4.8.1.1.2.[1]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1710:20250102:131843.309 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1786:20250102:131844.853 resuming SNMP agent checks on host "KNOWHERE": connection restored 1768:20250102:131916.676 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds 1765:20250102:131943.156 SNMP agent item "1.3.6.1.4.1.2272.1.4.8.1.1.2.[1]" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1713:20250102:131949.724 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: another network error, wait for 15 seconds 1753:20250102:132004.725 resuming SNMP agent checks on host "KNOWHERE": connection restored 1722:20250102:132028.392 SNMP agent item "1.3.6.1.4.1.2272.1.4.8" on host "KNOWHERE" failed: first network error, wait for 15 seconds
Expected:
Discovery polling follows update interval and target device is not overloaded with endless requests every 1 minute if even is data relieving is flapping due to the device slowness or timeouts.
Such behaviour doesn't observed with LTS 6.0.37, which was used before an upgrade. Can be related ZBX-22864