[ZBXNEXT-6999] Advanced SLA reporting Created: 2021 Oct 13  Updated: 2024 Apr 10  Resolved: 2022 Jun 15

Status: Closed
Project: ZABBIX FEATURE REQUESTS
Component/s: API (A), Frontend (F)
Affects Version/s: None
Fix Version/s: 6.0.0beta2, 6.0 (plan)

Type: Change Request Priority: Major
Reporter: Alexei Vladishev Assignee: Andrejs Verza
Resolution: Fixed Votes: 4
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: PDF File DEV-78267307-131021-1802-120.pdf     GIF File Enabled_Disabled_SLA_status.gif     GIF File Missing_error_widget_periods.gif     GIF File SLA_from_child_to_parent.gif     PNG File Screenshot from 2021-12-14 17-49-49.png     PNG File Screenshot from 2021-12-15 13-59-53.png     PNG File Screenshot from 2021-12-15 18-14-46.png     PNG File Screenshot from 2021-12-15 18-19-22.png     PNG File Screenshot from 2021-12-17 14-13-17.png     PNG File Screenshot from 2021-12-17 18-40-38.png     PNG File Screenshot from 2021-12-20 15-34-58.png     PNG File Screenshot from 2021-12-22 17-15-58.png     PNG File Screenshot from 2022-01-04 16-29-03.png     PNG File Screenshot from 2022-01-05 16-15-50.png     GIF File counter_and_actions_in_service_lists.gif     GIF File cross-section-filter.gif     GIF File cross-view-counter.gif     GIF File eseting_the_filter.gif     GIF File filter_uncollapses.gif     GIF File icon_dissapears.gif     GIF File incorrect_minutes_dropdown_value.gif     GIF File incorrect_stats_in_widget.gif     GIF File jumpy_link_edge.gif     GIF File lost_dates_in_pagination.gif     GIF File missing_alert.gif     GIF File missing_periods_after_migration.gif     GIF File missing_system_default.gif     GIF File no_hover_over_icon.gif     GIF File non_related_sla_service.gif     GIF File not_sortable_service_name.gif     GIF File shifting_column_values.gif     GIF File slo_validation_rounding_inconsistency.gif     GIF File strange_filter_behavior.gif     GIF File strange_redirect_after_deletion.gif     GIF File strange_way_of_populating_start_date.gif     GIF File tag_misunderstanding.gif    
Issue Links:
Causes
causes ZBX-21084 Inconsistent Custom schedule default ... Open
causes ZBX-21110 "Failed to parse time string" runtime... Closed
Duplicate
duplicates ZBXNEXT-1673 Possibility to manipulate SLA by addi... Open
is duplicated by ZBXNEXT-1131 IT Services do not indicate start of ... Open
Sub-task
Team: Team A
Sprint: Sprint 81 (Oct 2021), Sprint 82 (Nov 2021), Sprint 83 (Dec 2021), Sprint 84 (Jan 2022), Sprint 85 (Feb 2022), Sprint 86 (Mar 2022), Sprint 87 (Apr 2022), Sprint 88 (May 2022), Sprint 89 (Jun 2022)
Story Points: 19

 Description   

Current implementation of services was designed long time ago, it is rather limited and has significant limitations.

It would be nice to improve SLA reporting by introducing more flexible SLA definition (timezone, SLO, effective date, excluded downtimes) and SLI reporting showing all the numbers (SLO, SLI, uptime, downtime, error budget, etc).

Also it would be great to have ability to define multiple SLAs per service and link SLAs to services using tags. 



 Comments   
Comment by Alexei Vladishev [ 2021 Oct 13 ]

High level design document is attached for all interested. The functionality is coming in 6.0, actually it is one of a few blockers of this release.

Comment by David Collier [ 2021 Oct 13 ]

Thanks for this Alexei.

I have a question / comment.

The document suggests that the SLI's are going to be based on availability. Would the proposed solution also allow for performance items.

e.g.

SLI 1 : Server responds to ping is an availability metric.

SLI 2 : Ping response < 15ms is a performance metric.

I'm sure it would be very useful if customers - especially MSP's - can differentiate between availability and performance on an SLA report.

Would this be possible using the tag mechanism in the design ?

Other than that, it's a very welcome advance.

 

 

Comment by Alexei Vladishev [ 2021 Oct 14 ]

NobiusDave, it is absolutely possible. You may define two SLAs: one for performance and another for availability and link them to different sets of problems using tags, tag "Class:Performance" for the first SLA and "Class:Availability" for the second one.

So, in your example trigger "Server responds to ping is an availability metric." should be tagged with "Class:Availability", while trigger "Ping response < 15ms" with tag "Class:Performance".

I hope it makes sense.

Comment by Andrejs Verza [ 2022 Jan 07 ]

Implemented in 6.0.0beta2 (master) 6d5917ba51b.

Comment by Ognjen sijan [ 2022 Jan 13 ]

One suggestion a button to export to a csv file entire SLA. Other than this, it is looking very nice. 

Comment by Janis Freibergs [ 2022 Mar 02 ]

API documentation updated for v6.0:

Comment by Martins Valkovskis [ 2022 Mar 03 ]

Updated user documentation:

Comment by Constantin Oshmyan [ 2022 Apr 01 ]

Unfortunately, the current implementation of Services/SLA (in v6.0) is far from ideal

  • One of limitation mentioned in the provided document was:

    very limited visualization: no graphical representation of services, no good reporting

    The most annoying fact is that we have even a regression in this area
    Some examples of the lost functionality were described in ZBXNEXT-7392 and ZBXNEXT-7600.

  • Another pain point: implementing a tag-based link between services and triggers, it was forgotten about LLD-generated triggers. The ZBXNEXT-5467 becomes a very important; or, at least – ZBXNEXT-7593 as a workaround (if it's simpler to implement).
  • Documentation about API is incomplete, sometimes – just wrong (see ZBX-20784).

I very hope that an attention will be payed to these problems. Some of them are blockers for us preventing an upgrade from v5.0 to the next LTS version.

Comment by Constantin Oshmyan [ 2022 Oct 10 ]

alexei, we are discussed that on the Zabbix Summit-2022 recently, so this is just a reminder.
I understand that this ticket is closed already; however, it contains my previous message with the list of our pain points and a links to other tickets we are patiently waiting for.

All these issues (including regressed functionality and incomplete documentation) do still exist after a half of year, next version release (6.2) and once more next version (6.4) in the beta stage
We'll be grateful if these tickets should be implemented at least in the next LTS version (7.0).

Comment by Alexei Vladishev [ 2022 Oct 11 ]

constantin.oshmyan , thank you for the discussion at the Summit and all the reminders regarding BSM functionality. I'd like to see it implemented in 7.0 LTS.

Comment by Constantin Oshmyan [ 2023 Oct 10 ]

alexei, just reminder, as we discussed on the Zabbix Summit 2023
As I wrote a year ago:

I understand that this ticket is closed already; however, it contains my previous message with the list of our pain points and a links to other tickets we are patiently waiting for.

All these issues are still actual for us. We've upgraded our Zabbix system to the current LTS version (5.0 -> 6.0), however, we still need each of these functionality lacks; especially when some functionality existed before but has been lost in the v6.0.
We are looking for having them being implemented in the next LTS version (i.e. 7.0).

Generated at Sat Apr 19 03:56:31 EEST 2025 using Jira 9.12.4#9120004-sha1:625303b708afdb767e17cb2838290c41888e9ff0.