-
Incident report
-
Resolution: Won't fix
-
Trivial
-
None
-
3.0.0alpha5
-
None
-
Debian Linux, Virtualized on KVM, MySQL DB
I have noticed some alerts are not being received with a larger number of Escalator processes. With a value of 1 I have been unable to replicate any issue (yet). I have also been unable to replicate any issue without event recovery emails enabled (but I have not extensively tested this).
All escalator processes are running:
# ps aux | grep zabbix | grep escal zabbix 980 0.0 0.1 728040 5632 ? S 2015 1:50 /usr/sbin/zabbix_server: escalator #1 [processed 0 escalations in 0.000292 sec, idle 3 sec] zabbix 981 0.0 0.1 728040 5208 ? S 2015 1:46 /usr/sbin/zabbix_server: escalator #2 [processed 0 escalations in 0.000262 sec, idle 3 sec] zabbix 982 0.0 0.1 728040 5388 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #3 [processed 0 escalations in 0.000275 sec, idle 3 sec] zabbix 984 0.0 0.1 728040 5144 ? S 2015 1:51 /usr/sbin/zabbix_server: escalator #4 [processed 0 escalations in 0.000267 sec, idle 3 sec] zabbix 985 0.0 0.1 728040 5700 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #5 [processed 0 escalations in 0.000367 sec, idle 3 sec] zabbix 986 0.0 0.1 728040 5288 ? S 2015 1:51 /usr/sbin/zabbix_server: escalator #6 [processed 0 escalations in 0.000520 sec, idle 3 sec] zabbix 987 0.0 0.1 728040 5328 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #7 [processed 0 escalations in 0.000516 sec, idle 3 sec] zabbix 1000 0.0 0.1 728040 5372 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #8 [processed 0 escalations in 0.000378 sec, idle 3 sec] zabbix 1002 0.0 0.1 728040 5736 ? S 2015 1:51 /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000582 sec, idle 3 sec] zabbix 1004 0.0 0.1 728040 5280 ? S 2015 1:48 /usr/sbin/zabbix_server: escalator #10 [processed 0 escalations in 0.000343 sec, idle 3 sec] zabbix 1005 0.0 0.1 728040 5112 ? S 2015 1:49 /usr/sbin/zabbix_server: escalator #11 [processed 0 escalations in 0.000315 sec, idle 3 sec] zabbix 1006 0.0 0.1 728040 5196 ? S 2015 1:46 /usr/sbin/zabbix_server: escalator #12 [processed 0 escalations in 0.001168 sec, idle 3 sec] zabbix 1007 0.0 0.1 728040 5036 ? S 2015 1:46 /usr/sbin/zabbix_server: escalator #13 [processed 0 escalations in 0.000387 sec, idle 3 sec] zabbix 1015 0.0 0.1 728040 5352 ? S 2015 1:45 /usr/sbin/zabbix_server: escalator #14 [processed 0 escalations in 0.000384 sec, idle 3 sec] zabbix 1017 0.0 0.1 728040 5860 ? S 2015 1:54 /usr/sbin/zabbix_server: escalator #15 [processed 0 escalations in 0.000401 sec, idle 3 sec] zabbix 1018 0.0 0.1 728040 5768 ? S 2015 1:49 /usr/sbin/zabbix_server: escalator #16 [processed 0 escalations in 0.000187 sec, idle 3 sec] zabbix 1019 0.0 0.1 728040 5384 ? S 2015 1:48 /usr/sbin/zabbix_server: escalator #17 [processed 0 escalations in 0.000264 sec, idle 3 sec] zabbix 1020 0.0 0.1 728040 5448 ? S 2015 1:48 /usr/sbin/zabbix_server: escalator #18 [processed 0 escalations in 0.000260 sec, idle 3 sec] zabbix 1021 0.0 0.1 728040 5344 ? S 2015 1:51 /usr/sbin/zabbix_server: escalator #19 [processed 0 escalations in 0.000694 sec, idle 3 sec] zabbix 1022 0.0 0.1 728040 5332 ? S 2015 1:46 /usr/sbin/zabbix_server: escalator #20 [processed 0 escalations in 0.000446 sec, idle 3 sec] zabbix 1023 0.0 0.1 728040 5164 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #21 [processed 0 escalations in 0.000513 sec, idle 3 sec] zabbix 1024 0.0 0.1 728040 5340 ? S 2015 1:49 /usr/sbin/zabbix_server: escalator #22 [processed 0 escalations in 0.000214 sec, idle 3 sec] zabbix 1025 0.0 0.1 728040 5224 ? S 2015 1:50 /usr/sbin/zabbix_server: escalator #23 [processed 0 escalations in 0.000170 sec, idle 3 sec] zabbix 1026 0.0 0.1 728040 5512 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #24 [processed 0 escalations in 0.000215 sec, idle 3 sec] zabbix 1027 0.0 0.1 728040 5664 ? S 2015 1:47 /usr/sbin/zabbix_server: escalator #25 [processed 0 escalations in 0.000376 sec, idle 3 sec]
Sometimes it is just the PROBLEM email, othertimes only a recovery is sent.
Screenshots attached:
1. Event example
2. Email example
3. Action configuration
In the email screenshot every email should have a count of two as the threads are grouped by Subject Event ID, and the Recovery email makes it 2 per.
Let me know if there is any troubleshooting you need me to perform. I will keep looking for the issue myself, and see if I can confirm the issue factors.