[ZBXNEXT-1414] Adjust Item interval based on Trigger State Created: 2012 Sep 08  Updated: 2023 Nov 02

Status: Open
Project: ZABBIX FEATURE REQUESTS
Component/s: None
Affects Version/s: 2.0.3
Fix Version/s: None

Type: New Feature Request Priority: Major
Reporter: Florian Koch Assignee: Unassigned
Resolution: Unresolved Votes: 51
Labels: interval, items, scheduling
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Duplicate
is duplicated by ZBXNEXT-1475 Recheck Interval for Items Closed
is duplicated by ZBXNEXT-1979 Check item more frequent when trigger... Closed

 Description   

Hi,

i like to have support for the following feature in zabbix:

adjust the check interval of the item which causes the Trigger to go into PROBLEM state and restore orginal vaue if Trigger go into OK state back

e.g a item checks in 300s interval if a service is running and a trigger reports this , adjust this time to e.g 60s if the trigger go into PROLEM state, so if the service is only 1 minute down, thetrigger OK is send after 1 or 2 minuten not 5 minutes so the "Downtime" zabbix reports is 1 or 2 min not 5.

Yes the check interval can be set to 1 min, but on big installations, you need carefully set the check intervals.

regards Florian



 Comments   
Comment by Ahmad Khayyat [ 2012 Dec 27 ]

It may be useful to consult with how OpenNMS implements this feature, which it calls "Downtime Models":

http://www.opennms.org/wiki/Polling_Configuration_How-To#Downtime_Models

The incentive is to report more accurate SLA:

By default, the poller will poll every five minutes. If that polling rate was static, then the shortest an outage could be would be five minutes: one poll to note the outage and the next to note it was restored.

...

To help combat this, OpenNMS uses adaptive polling. Once an outage is detected, polling is temporarily increased to try and detect, as soon as possible, when the service is restored.

<downtime interval="30000" begin="0" end="300000"/> <!-- 30s, 0, 5m -->
<downtime interval="300000" begin="300000" end="43200000"/> <!-- 5m, 5m, 12h -->
<downtime interval="600000" begin="43200000" end="432000000"/> <!-- 10m, 12h, 5d -->
<downtime begin="432000000" delete="true"/> <!-- anything after 5 days delete -->

Comment by Luca [ 2015 May 11 ]

The "Downtime Models" by OpenNMS is good exaple. The other options is to use the "action" function yet present in zabbix by execute a shell action that change the item.
You can call API from shell but is not easy to implement without variable. The flow can be 1) trigger go into PROBLEM 2) execute action "call API and alter item's update interval" 3) trigger go into OK 4) execute action "call API and alter item's update interval" .

Comment by Corey Shaw [ 2015 May 11 ]

One major issue with using the API to alert the item's update interval is the caching that goes on. For instance, the configuration syncer runs every 1 minute by default on a Zabbix server. If I modify the item interval in the method mentioned by Luca, then I will up to a one minute timeframe before that change goes into effect. This doesn't sound so bad in a very basic default install, but it becomes much worse as you add in proxies and refresh intervals for active items.

For instance, in my Zabbix environment my server updates its config cache every 10 minutes, my proxies update their cache every 10 minutes, and my agents update their active checks every 10 minutes. So in the situation given above, I would have to wait up to 10+10+10 minutes (30 of course ) before my item interval could change.

Comment by Bjarne Offenberg [ 2015 Aug 27 ]

I see that this feature request is already 3 years old, but has not yet been implemented.
I found this through this forum thread; https://www.zabbix.com/forum/showthread.php?t=14113 . which links to ZBXNEXT-1475.

I see that web-monitoring has implemented retries.
https://www.zabbix.com/documentation/2.2/manual/web_monitoring

Can the same technique that is used for web_monitoring be used for simple checks?

Comment by Luca [ 2015 Aug 27 ]

Quote for a re-check interval feauture.
If someone look also for bypass first failure trigger, look at https://www.zabbix.com/forum/showpost.php?s=a2cf4ca16769d1443c0c78d862dd1c5f&p=51870&postcount=2

Comment by Alex Hart [ 2016 Jun 27 ]

Nagios also has this (retry_check_interval) details: http://serverfault.com/questions/329125/nagios-check-service-frequency-based-on-service-status

Comment by Horacio [ 2016 Oct 19 ]

Related

Comment by Jake Hill [ 2017 May 04 ]

Very interested in this feature. Would dramatically improve the quality of our open alerts both inside of zabbix and when brokered to services like PagerDuty.

+1

Comment by Glebs Ivanovskis (Inactive) [ 2017 Jun 19 ]

Once we have ZBXNEXT-1675, ZBXNEXT-3704 offers a more general approach to achieve same result.

Comment by richlv [ 2017 Sep 15 ]

a hack using the api has been described at http://www.zabbixbook.com/2016/10/24/using-the-zabbixtiny-to-change-an-item-interval-based-on-a-trigger/

Comment by Alexei Vladishev [ 2019 Mar 21 ]

I believe that it is quite good use case for throttling functionality, which was introduced in 4.2.

Comment by Mickael Martin [ 2022 May 03 ]

Hello,

Any update ?

Comment by LivreAcesso.Pro [ 2023 Nov 02 ]

See ZBXNEXT-8609

Generated at Sat Aug 02 10:25:55 EEST 2025 using Jira 9.12.4#9120004-sha1:625303b708afdb767e17cb2838290c41888e9ff0.