[ZBX-16970] Random crash server Created: 2019 Nov 21 Updated: 2019 Nov 22 Resolved: 2019 Nov 21 |
|
Status: | Closed |
Project: | ZABBIX BUGS AND ISSUES |
Component/s: | None |
Affects Version/s: | None |
Fix Version/s: | None |
Type: | Incident report | Priority: | Critical |
Reporter: | Outscale Monitoring Team | Assignee: | Zabbix Support Team |
Resolution: | Commercial support required | Votes: | 0 |
Labels: | None | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified |
Attachments: | image-2019-11-21-13-37-59-934.png logs-ZabbixServer.txt |
Description |
Hi, Since our migration to 4.2.4, our Zabbix Server crash randomly. Do you have any explanations? You will find our logs file attached. Thanks you by advance.
|
Comments |
Comment by Outscale Monitoring Team [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Hello again, Seems linked to the escalator process as we have already met this issue 3 times since September. For now it happened in the middle of the night with no actions at all , and also while shortly after adding new actions ( triggers and autoreg ). Disabling the action and cleaning the escalations table permitted us to restart the server. We never had any issue of this kind before upgrading from 3.4.8 to 4.2.4 DB is mysql, Server in 4.2.4 , Server with 7k hosts, 22 k nvps. Not so sure if we should upgrade and if it would help in anything at this point but we keep our investigations.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comment by Vladislavs Sokurenko [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Could you please show output of show create table actions and show create table operations ? | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comment by Outscale Monitoring Team [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Thanks for your answer Vladislavs. FYI, i just restarted my server by truncating escalations again.
Here is the show create table actions :
mysql> show create table actions;
--------
--------
mysql> show create table operations;
-----------
-----------
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comment by Outscale Monitoring Team [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Escalations table looking like that just before being truncated.
mysql> select * from escalations;
-------------
------------- | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comment by Vladislavs Sokurenko [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
backtrace for easier searching: 582:20191121:050546.428 Got signal [signal:11(SIGSEGV),reason:1,refaddr:(nil)]. Crashing ... 582:20191121:050546.428 ====== Fatal information: ====== 582:20191121:050546.428 Program counter: 0x7f6873f9bd67 582:20191121:050546.428 === Registers: === 582:20191121:050546.428 r8 = 7f687431f060 = 140086602756192 = 140086602756192 582:20191121:050546.428 r9 = 1 = 1 = 1 582:20191121:050546.428 r10 = 2 = 2 = 2 582:20191121:050546.429 r11 = 0 = 0 = 0 582:20191121:050546.429 r12 = 7f6877be5565 = 140086662288741 = 140086662288741 582:20191121:050546.429 r13 = 0 = 0 = 0 582:20191121:050546.429 r14 = 0 = 0 = 0 582:20191121:050546.429 r15 = 0 = 0 = 0 582:20191121:050546.429 rdi = 0 = 0 = 0 582:20191121:050546.429 rsi = 0 = 0 = 0 582:20191121:050546.429 rbp = 0 = 0 = 0 582:20191121:050546.429 rbx = 9 = 9 = 9 582:20191121:050546.429 rdx = a = 10 = 10 582:20191121:050546.429 rax = 7f687431c260 = 140086602744416 = 140086602744416 582:20191121:050546.429 rcx = 0 = 0 = 0 582:20191121:050546.429 rsp = 7ffe6e0a5a70 = 140730744593008 = 140730744593008 582:20191121:050546.429 rip = 7f6873f9bd67 = 140086599073127 = 140086599073127 582:20191121:050546.429 efl = 10283 = 66179 = 66179 582:20191121:050546.429 csgsfs = 33 = 51 = 51 582:20191121:050546.429 err = 4 = 4 = 4 582:20191121:050546.429 trapno = e = 14 = 14 582:20191121:050546.429 oldmask = 0 = 0 = 0 582:20191121:050546.429 cr2 = 0 = 0 = 0 582:20191121:050546.429 === Backtrace: === 597:20191121:050546.429 server #64 started poller #6 582:20191121:050546.429 15: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](zbx_backtrace+0x42) [0x7f6877b29f1a] 582:20191121:050546.429 14: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](zbx_log_fatal_info+0x151) [0x7f6877b2a198] 582:20191121:050546.429 13: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](+0x19b532) [0x7f6877b2a532] 582:20191121:050546.429 12: /lib64/libpthread.so.0(+0xf5e0) [0x7f687705e5e0] 582:20191121:050546.429 11: /lib64/libc.so.6(+0x39d67) [0x7f6873f9bd67] 582:20191121:050546.429 10: /lib64/libc.so.6(atoi+0x10) [0x7f6873f98770] 582:20191121:050546.429 9: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](get_db_actions_info+0x2f7) [0x7f6877ab4363] 582:20191121:050546.429 8: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](+0x80551) [0x7f6877a0f551] 582:20191121:050546.429 7: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](+0x8155c) [0x7f6877a1055c] 582:20191121:050546.429 6: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](escalator_thread+0x1f3) [0x7f6877a107c2] 582:20191121:050546.429 5: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](zbx_thread_start+0x37) [0x7f6877b37888] 582:20191121:050546.429 4: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](MAIN_ZABBIX_ENTRY+0xcae) [0x7f68779cf762] 582:20191121:050546.429 3: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](daemon_start+0x2f6) [0x7f6877b2975e] 582:20191121:050546.429 2: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](main+0x312) [0x7f68779ceab2] 582:20191121:050546.429 1: /lib64/libc.so.6(__libc_start_main+0xf5) [0x7f6873f83c05] 582:20191121:050546.429 0: /usr/sbin/zabbix_server: escalator #9 [processed 0 escalations in 0.000000 sec, processing escalations](+0x3eb79) [0x7f68779cdb79] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comment by Vladislavs Sokurenko [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
This is incorrect
`pause_suppressed` int(11) DEFAULT NULL,
Must be `pause_suppressed` int(11) NOT NULL DEFAULT '1', No indication of a bug, closing. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comment by Outscale Monitoring Team [ 2019 Nov 21 ] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Excellent ! We will fix our actions and give you a feedback. Thanks a lot Vladislavs. |