[ZBX-3346] Incorrect delay betwen few checks an SNMP item Created: 2010 Dec 28  Updated: 2017 May 30  Resolved: 2011 Feb 16

Status: Closed
Project: ZABBIX BUGS AND ISSUES
Component/s: Proxy (P), Server (S)
Affects Version/s: 1.8.4rc3
Fix Version/s: 1.8.5

Type: Incident report Priority: Critical
Reporter: Alexey Pustovalov Assignee: Unassigned
Resolution: Fixed Votes: 1
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Gentoo 34-r6, Oracle 10g2, amd64


Attachments: PNG File Screen shot 2011-01-12 at 9.27.29 AM.png     PNG File chart2.png     PNG File chart2.png     PNG File item_settings.png     PNG File values.png     Text File zabbix-1.8.4-delay.patch    
Issue Links:
Duplicate
is duplicated by ZBX-3539 Item receve values that are not suppo... Closed
is duplicated by ZBX-3448 SNMP Delta or SimpleChange have Get A... Closed

 Description   

SNMP check. Update interval : 5 min (same for 1 min and for 90 seconds)

1 case - the counter has no time to grow. Obtain a failure where it was not.
27.???.2010 15:55:09 148670656
27.???.2010 15:50:15 152791576
27.???.2010 15:45:09 153212280
27.???.2010 15:40:17 0
27.???.2010 15:40:16 158109472
27.???.2010 15:35:09 156864880
27.???.2010 15:30:09 154335408
27.???.2010 15:25:10 155345144

2 cases - the counter has time to grow. Even more frustrating because breaks down the scale of the map.
27.???.2010 13:00:09 147378488
27.???.2010 12:55:11 146663408
27.???.2010 12:50:11 120672416
27.???.2010 12:45:12 1214767736
27.???.2010 12:45:11 117281512
27.???.2010 12:40:09 118080888
27.???.2010 12:35:09 124867040
27.???.2010 12:30:09 131032560

Zabbix Server v1.8.4rc2 (revision

{ZABBIX_REVISION}

) (1 November 2010)
Compilation time: Nov 11 2010 01:23:45



 Comments   
Comment by Alexey Pustovalov [ 2011 Jan 06 ]

troubles only on high load zabbix server:
Количество узлов сети (под наблюдением/без наблюдения/шаблоны) 1117 842 / 25 / 250
Количество элементов данных (активных/деактивированых/не поддерживаются) 51564 49212 / 2114 / 238
Количество триггеров (активированных/деактивированных)[проблема/неизвестно/ок] 14725 14567 / 158 [233 / 1203 / 13130]
Количество пользователей (подключенных в данный момент) 91 10
Требуемое быстродействие сервера, новые значения в секунду 218.7 -

with config:
cat /etc/zabbix/zabbix_server.conf | egrep ^Start
StartPollers=130
StartIPMIPollers=0
StartPollersUnreachable=40
StartTrappers=120
StartPingers=60
StartDiscoverers=6
StartHTTPPollers=4
StartDBSyncers=30

Comment by Alexey Pustovalov [ 2011 Jan 06 ]

Server have 16GB RAM, RAID 1, and 2 Intel(R) Xeon(R) CPU E5420

Comment by Jon Henry [ 2011 Jan 11 ]

We are seeing this issue on some SNMPv2 checks. CentOS 5.5 64-bit, MySQL 5.1 64-bit. Zabbix 1.8.4 final. We have been using Zabbix for a long time and this seems new in 1.8.4.

The proxy is not very heavily loaded but the main server is (we have six proxies total). 3427 total hosts, 150969 total items.

The logs are exactly like Pustovalov posted. Sometimes the value is 0 and the poll interval is totally wrong. Other times it is double the correct value.

Please let me know if you need any other information to help troubleshoot, this is a major issue for us.

Comment by Jon Henry [ 2011 Jan 11 ]

Example: Poll interval is 180 seconds (3 minutes)

2011-01-10 14:05:42 1294693542 90382920
2011-01-10 14:02:46 1294693366 0
2011-01-10 13:59:41 1294693181 89447248
2011-01-10 13:53:29 1294692809 90382920
2011-01-10 13:47:29 1294692449 100636448

It seems to be missing every other poll, and the one time it DOES poll after 3 minutes, it just returns 0.

Comment by Igor Danoshaites (Inactive) [ 2011 Jan 12 ]

Henry, but as as I see from your latest data your configured Update interval for this item is 6 min, but not 3 min (13:47:29 - 13:53:29).

Comment by Alexei Vladishev [ 2011 Jan 12 ]

Jon, is there any chance to get a screenshot of the item configuration screen in order to see its configuration? It looks very much like you are calculating delta for a non-growing counter, so Zabbix just skips negative numbers (deltas).

Comment by Brian Talley [ 2011 Jan 12 ]

Item configuration showing 180 sec interval

Comment by Jon Henry [ 2011 Jan 12 ]

That last screenshot was mine. I was logged in under a co-worker's account by mistake. As you can see it is configured for a 3 minute poll cycle.

We are seeing another problem with polling on this template, too. Sometimes the values are roughly double what they should be, and sometimes it polls twice within a few seconds. Look at the lines marked with ***** below

2011-01-10 11:05:59 1294682759 68657176 *****
2011-01-10 11:05:54 1294682754 86229008 *****
2011-01-10 10:59:31 1294682371 96623168
2011-01-10 10:53:30 1294682010 96689960
2011-01-10 10:47:29 1294681649 99154472
2011-01-10 10:41:36 1294681296 106047664
2011-01-10 10:38:47 1294681127 231917632 ***** (this shows as a HUGE spike in the graph, to an impossible value for the hardware)
2011-01-10 10:35:29 1294680929 99443488
2011-01-10 10:29:31 1294680571 105943472
2011-01-10 10:23:29 1294680209 90671448
2011-01-10 10:17:30 1294679850 88653544
2011-01-10 10:11:53 1294679513 98244104

Comment by Alexey Pustovalov [ 2011 Jan 12 ]

last missed interval for SNMP item. I add screenshots in bug for thise.

Comment by Alexey Pustovalov [ 2011 Jan 12 ]

today i discover what troubles with snmp hosts via zabbix proxy too.
smallest proxy have 45 values per second, if needed we can try enable debug logging for thise proxy. Nedeed?

Comment by Alexander Vladishev [ 2011 Jan 17 ]

Fixed in development branch svn://svn.zabbix.com/branches/dev/ZBX-3346

Comment by Alexey Pustovalov [ 2011 Jan 17 ]

not fixed for me with branch:
17.Янв.2011 21:42:09 1023470720
17.Янв.2011 21:41:09 1141374480
17.Янв.2011 21:40:24 0
17.Янв.2011 21:40:23 966650264
17.Янв.2011 21:39:10 974736736

17.Янв.2011 21:21:09 1044514408
17.Янв.2011 21:20:40 1111817480
17.Янв.2011 21:20:31 1097529192
17.Янв.2011 21:19:27 0
17.Янв.2011 21:19:22 978083784

17.Янв.2011 21:05:10 1195238584
17.Янв.2011 21:04:28 0
17.Янв.2011 21:04:23 1048879800
17.Янв.2011 21:03:15 923030568

Comment by Alexander Vladishev [ 2011 Jan 18 ]

Thank you, Alexey.
Please, re-test this issue. Now should work correctly.

Comment by richlv [ 2011 Jan 18 ]

any info on what was the cause and when did the problem manifest itself ?

Comment by Aleksandrs Saveljevs [ 2011 Jan 18 ]

The problem seemed to occur when configuration cache was updating at the same time when an item was already being processed by some poller. An item got inserted into the queue, even though it should not have been.

Comment by Alexey Pustovalov [ 2011 Jan 18 ]

fixed... now items not checks twice...

Comment by Jon Henry [ 2011 Jan 18 ]

The svn://svn.zabbix.com/branches/dev/ZBX-3346 URL stopped working today. Is there another URL I can use to downloaded the fixed code? I'd like to patch our servers since this is causing big problems for us. Unless 1.8.5 is coming out very soon

Comment by Alexei Vladishev [ 2011 Jan 18 ]

The very latest 1.8.x code is always available at svn://svn.zabbix.com/branches/1.8. We remove development branches as soon as a change is merged into stable branch and/or trunk.

Comment by Jon Henry [ 2011 Jan 18 ]

I see. Thank you!

Comment by Alexander Vladishev [ 2011 Jan 18 ]

Fixed in version pre1.8.5, r16995.

Comment by as [ 2011 Feb 16 ]

I have the latest stable version 1.8.4. Where can I download a "patch" to fixe this problem, as Jon refer url "svn://svn.zabbix.com/branches/dev/ZBX-3346" is not working.

Comment by Igor Danoshaites (Inactive) [ 2011 Feb 16 ]

Hi!

This issue has been fixed in the pre-v1.8.5.
Please also find a patch file for v1.8.4 for fixing this issue in the attachment to this issue.

Comment by Igor Danoshaites (Inactive) [ 2011 Feb 16 ]

Closing resolved issue.

Comment by Aleksandrs Saveljevs [ 2011 Feb 16 ]

As Alexei mentioned earlier, development branches are removed as soon as the changes are merged into the main branches. The latest 1.8 code (with this fix included) is always available at svn://svn.zabbix.com/branches/1.8. If you wish to patch 1.8.4, you can do "svn di -c 16995 svn://svn.zabbix.com/branches/1.8" and apply the resulting patch on 1.8.4 distribution. I have not tried though.

Comment by Aleksandrs Saveljevs [ 2011 Feb 16 ]

Ignore my comment. Igor was faster.

Comment by as [ 2011 Feb 16 ]

Hi Igor/Aleksandrs

Thanks by your fast reply.

How can I apply the patch ? It is safe to apply the patch can I do a backup.

Many thanks in advance.

Comment by Aleksandrs Saveljevs [ 2011 Feb 17 ]

Please find useful instructions from Igor for applying the patch:

You should patch your zabbix_server and zabbix_proxy and then re-compile and re-install them.

1. cd zabbix-1.8.4
2. execute command zabbix-1.8.4-delay.patch|patch -p0
3. execute command "make clean"
4. execute command "make install"
5. update your zabbix_server and/or zabbix_proxy binary files on the server with new compiled binary files
6. restart zabbix_server and/or zabbix_proxy

Comment by Igor Danoshaites (Inactive) [ 2011 Feb 17 ]

>How can I apply the patch ? Is it safe to apply the patch ? Can I do a backup ?

-->1. Sorry, small mistake in my comment, in the second row the correct command should be: "cat zabbix-1.8.4-delay.patch|patch -p0"
Please note that cat should be in the beginning of this command.

2. Yes, it is absolutely safe to apply this patch.
3. Yes, if you want you can do a backup (backup of the whole zabbix directory). This patch will patch only one file in your zabbix installation: src/libs/zbxdbcache/dbconfig.c

Comment by as [ 2011 Feb 21 ]

Hi,

This really solve my problem, great support people .

Many thanks especialy to Aleksandrs Saveljevs and Igor Danoshaites.

Regards.

Comment by hamid sfandiari [ 2011 Oct 11 ]

this issue was happened again in 1.8.8 stable
with snmp Speed check of IFHCOUTOCTETS/IFHCINOCTETS oid
interval 10 sec or 15 sec

sample data :
248809160
2011.Oct.11 18:01:28 421387928
2011.Oct.11 18:01:12 218113736
2011.Oct.11 18:00:57 429393864
2011.Oct.11 18:00:42 207361472
2011.Oct.11 18:00:27 421759768
2011.Oct.11 18:00:12 205987552
2011.Oct.11 17:59:57 217666616
2011.Oct.11 17:59:43 386938184
2011.Oct.11 17:59:27 454190920
2011.Oct.11 17:59:13 200073112
2011.Oct.11 17:58:57 419605816

but the real speed is around 200-220Mbps

Generated at Sat May 17 06:52:32 EEST 2025 using Jira 9.12.4#9120004-sha1:625303b708afdb767e17cb2838290c41888e9ff0.