[#ZBX-8095] Gaps in agent.ping items cause random triggers being activated

$ grep itemid:83596 zabbix_server_20140527_0800-0830.log | grep -v -e query -e zbx_vc_ ... 7991:20140527:081213.517 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171136,2014.05.27 08:12:16' old_nextcheck:'1401171136,2014.05.27 08:12:16' 7998:20140527:081216.441 In activate_host() hostid:10584 itemid:83596 type:0 7998:20140527:081216.442 DCupdate_item_queue() itemid:83596 location:0 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081314.015 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081414.645 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081515.166 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081615.685 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081716.173 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081816.700 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:081917.191 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:082017.803 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7991:20140527:082118.303 DCupdate_item_queue() itemid:83596 location:1 poller_type:0 nextcheck:'1401171196,2014.05.27 08:13:16' old_nextcheck:'1401171196,2014.05.27 08:13:16' 7998:20140527:082216.058 In activate_host() hostid:10584 itemid:83596 type:0 7998:20140527:082216.058 DCupdate_item_queue() itemid:83596 location:0 poller_type:0 nextcheck:'1401171796,2014.05.27 08:23:16' old_nextcheck:'1401171796,2014.05.27 08:23:16' ...

$ grep 'get.value.agent.*centos6-clouderamanager-caroni.*agent.ping' zabbix_server_20140527_0800-0830.log ... 8000:20140527:081116.028 In get_value_agent() host:'centos6-clouderamanager-caroni' addr:'centos6-clouderamanager-caroni.adc.ttl' key:'agent.ping' 7998:20140527:081216.440 In get_value_agent() host:'centos6-clouderamanager-caroni' addr:'centos6-clouderamanager-caroni.adc.ttl' key:'agent.ping' 7998:20140527:082216.056 In get_value_agent() host:'centos6-clouderamanager-caroni' addr:'centos6-clouderamanager-caroni.adc.ttl' key:'agent.ping' 8002:20140527:082316.266 In get_value_agent() host:'centos6-clouderamanager-caroni' addr:'centos6-clouderamanager-caroni.adc.ttl' key:'agent.ping' ... $ grep 'get.value.agent.*centos6-mail-mack.*agent.ping' zabbix_server_20140602_1030-1055.log ... 22438:20140602:104046.170 In get_value_agent() host:'centos6-mail-mack' addr:'centos6-mail-mack.adc.ttl' key:'agent.ping' 22439:20140602:104146.976 In get_value_agent() host:'centos6-mail-mack' addr:'centos6-mail-mack.adc.ttl' key:'agent.ping' 22438:20140602:105216.907 In get_value_agent() host:'centos6-mail-mack' addr:'centos6-mail-mack.adc.ttl' key:'agent.ping' 22443:20140602:105246.927 In get_value_agent() host:'centos6-mail-mack' addr:'centos6-mail-mack.adc.ttl' key:'agent.ping' ... $ grep 'get.value.agent.*centos6-ldap-ledgic.*agent.ping' zabbix_server_20140603_1300-1314.log 4840:20140603:130048.177 In get_value_agent() host:'centos6-ldap-ledgic' addr:'centos6-ldap-ledgic.adc.ttl' key:'agent.ping' 4834:20140603:130148.502 In get_value_agent() host:'centos6-ldap-ledgic' addr:'centos6-ldap-ledgic.adc.ttl' key:'agent.ping' 4842:20140603:131216.722 In get_value_agent() host:'centos6-ldap-ledgic' addr:'centos6-ldap-ledgic.adc.ttl' key:'agent.ping' 4844:20140603:131248.915 In get_value_agent() host:'centos6-ldap-ledgic' addr:'centos6-ldap-ledgic.adc.ttl' key:'agent.ping' 4843:20140603:131348.178 In get_value_agent() host:'centos6-ldap-ledgic' addr:'centos6-ldap-ledgic.adc.ttl' key:'agent.ping'

...
4834:20140603:130148.504 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:-1 old_itemid_at_index:0 old_size_of_heap:21926 old_min_nextcheck_in_heap:'1401793309,2014.06.03 13:01:49' location:0 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4834:20140603:130148.504 DCupdate_item_queue() itemid:48048 new_index_in_heap:21926 new_itemid_at_index:48048 new_size_of_heap:21927 new_min_nextcheck_in_heap:'1401793309,2014.06.03 13:01:49'
4831:20140603:130155.723 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793316,2014.06.03 13:01:56' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130256.528 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793377,2014.06.03 13:02:57' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130357.387 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21933 old_min_nextcheck_in_heap:'1401793437,2014.06.03 13:03:57' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130458.336 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21932 old_min_nextcheck_in_heap:'1401793499,2014.06.03 13:04:59' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130559.748 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793560,2014.06.03 13:06:00' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130700.487 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793620,2014.06.03 13:07:00' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130801.059 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793681,2014.06.03 13:08:01' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:130901.667 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21933 old_min_nextcheck_in_heap:'1401793742,2014.06.03 13:09:02' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:131002.400 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21930 old_min_nextcheck_in_heap:'1401793803,2014.06.03 13:10:03' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:131103.045 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:17433 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793863,2014.06.03 13:11:03' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4831:20140603:131203.620 DCupdate_item_queue() itemid:48048 state:0 old_index_in_heap:2178 old_itemid_at_index:48048 old_size_of_heap:21934 old_min_nextcheck_in_heap:'1401793924,2014.06.03 13:12:04' location:1 poller_type:0 nextcheck:'1401793368,2014.06.03 13:02:48' old_poller_type:0 old_nextcheck:'1401793368,2014.06.03 13:02:48'
4842:20140603:131216.722 DCconfig_get_poller_items() itemid:48048 nextcheck:'1401793368,2014.06.03 13:02:48'
4842:20140603:131216.722 DCconfig_get_poller_items() itemid:48048 taking item for processing
...

We can see that for 10 minutes item stays in the heap at the same place without moving.

This motivated me to add more debugging information to the heap to try to understand why.

Bart, could you please try the new patch nextcheck-queueing-more-plus-heap-fix-and-debug.patch? It includes all previous patches.

mysql> select * from items where itemid=131536\G *************************** 1. row *************************** itemid: 131536 type: 0 snmp_community: snmp_oid: hostid: 10644 name: tupreturned key_: postgresql[tupreturned] delay: 30 history: 90 trends: 365 status: 0 value_type: 0 trapper_hosts: units: multiplier: 0 delta: 0 snmpv3_securityname: snmpv3_securitylevel: 0 snmpv3_authpassphrase: snmpv3_privpassphrase: formula: 1 error: Type of received value [2535931851232.000000] is not suitable for value type [Numeric (float)] lastlogsize: 0 logtimefmt: templateid: 130436 valuemapid: NULL delay_flex: params: ipmi_sensor: data_type: 0 authtype: 0 username: password: publickey: privatekey: mtime: 0 flags: 0 filter: interfaceid: 578 port: description: inventory_link: 0 lifetime: 30 snmpv3_authprotocol: 0 snmpv3_privprotocol: 0 state: 1 snmpv3_contextname: 1 row in set (0.00 sec)

$ svn log -c 37120 ------------------------------------------------------------------------ r37120 | pavels | 2013-07-18 15:30:02 +0300 (Thu, 18 Jul 2013) | 10 lines A.F.....S. [ZBXNEXT-1689] implemented server-side value cache Before the change: - Information about last and previous item values was stored in the items tables which required frequent updates and caused table availability problems. After the change: - Zabbix server stores last item values in the value cache. Frontend retrieves those values directly from the history tables. If the history storage period is set to 0, item last value information will not be available in the overview, latest data pages and the {ITEM.LASTVALUE} macro will not work. Item last check dates will no longer be displayed for unsupported items. Item change value will no longer be displayed in the latest data page when an item receives the first value. - Item lastclock, lastns, lastvalue, prevvalue and prevorgvalue columns have been dropped. - Item API prevorgvalue property was removed, lastclock, lastns, lastvalue and prevvalue properties are still supported. Host.get with_historical_items and hostgroup.get with_historical_items parameters were removed. - The item queue is now requested from the Zabbix server instead of being calculated on the frontend side. ------------------------------------------------------------------------

[ZBX-8095] Gaps in agent.ping items cause random triggers being activated Created: 2014 Apr 16 Updated: 2017 May 30 Resolved: 2014 Jun 11
Status:	Closed
Project:	ZABBIX BUGS AND ISSUES
Component/s:	Server (S)
Affects Version/s:	2.2.3
Fix Version/s:	2.2.4rc2, 2.3.2

[ZBX-8095] Gaps in agent.ping items cause random triggers being activated Created: 2014 Apr 16 Updated: 2017 May 30 Resolved: 2014 Jun 11