-
New Feature Request
-
Resolution: Fixed
-
Trivial
-
None
-
None
-
Sprint 77 (Jun 2021), Sprint 78 (Jul 2021), Sprint 79 (Aug 2021), Sprint 80 (Sep 2021), Sprint 81 (Oct 2021), Sprint 82 (Nov 2021), Sprint 83 (Dec 2021), Sprint 84 (Jan 2022), Sprint 85 (Feb 2022), Sprint 86 (Mar 2022)
-
10
Summary
Current implementation of services was designed long time ago, it is rather limited and has significant limitations. Major limitations are:
- scalability: it does not scale well, also bad user experience when number of services exceeds few hundreds
- hard to configure: mapping based on triggers does not work well and requires significant effort
- very limited visualization: no graphical representation of services, no good reporting
- limited SLA calculation rules
- no alerting in case of service status changes
- user permissions: all or nothing
Proposed functionality will introduce alerting for services.
Acceptance
Current implementation of Services must be extended to support alerting for service status changes:
- Each service status change will generate a new event (new event source EVENT_SOURCE_SERVICES)
- Zabbix will be extended to allow alerting on the service events similarly to existing functionality of trigger-based actions
- Normal, recovery and update operations must be supported
- Escalation options must be supported
- The following conditions will be supported:
- Service tag name
- Service tag value
- Service status:
- Condition: equals, does not equal, is greater than or equal, is less than or equal
- Service status: OK, Not classified, Information, Warning, Average, High, Disaster
- Service name (equals or contains)
- Service (multi-select of services)
- A new set of macros must be introduced:
-
- {SERVICE.NAME}
: service name
- {SERVICE.STATUS}
: service status in text
- {SERVICE.NSTATUS}
: service status numeric
- {SERVICE.TAGS}
: a comma separated list of service tags
- {SERVICE.TAGSJSON}
: a JSON array containing service tags
- {SERVICE.TAGS.<tag name>}
: tag value
- {SERVICE.ROOTCAUSE}
: list of problem events that caused service to fail sorted by severity and host name (host name, event name, severity, age, tags)
- Only problems having real impact on the service must be in the listÂ
- No other macros will be supported to keep this functionality "disconnected" from Zabbix core functionality
- {SERVICE.NAME}
- Media type's message template must be extended with new message types "Service", "Service recovery" and "Service update"
- Only users having at least read access to services will be notified
- caused by
-
ZBXNEXT-2253 [PATCH] IT services actions support
- Closed
- causes
-
ZBX-21021 Service actions are not executing scripts
- Closed