[ZBX-3301] Zabbix Server crashed Created: 2010 Dec 17  Updated: 2017 May 30  Resolved: 2010 Dec 20

Status: Closed
Project: ZABBIX BUGS AND ISSUES
Component/s: Server (S)
Affects Version/s: 1.8.4rc2
Fix Version/s: 1.8.4

Type: Incident report Priority: Minor
Reporter: sYBEx Assignee: Unassigned
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

RHEL5, 8GB RAM, MySQL, ...



 Description   

I have running this Version since 2 Weeks ... added no new hosts etc.
Yesterday it crashed and zabbix server was not running anymore ...

Here is the logfile output:

3354:20101216:234610.699 Got signal [signal:11(SIGSEGV),reason:1,refaddr:(nil)]. Crashing ...
3354:20101216:234610.714 ====== Fatal information: ======
3354:20101216:234610.714 Program counter: 0x3f2627a1db
3354:20101216:234610.714 === Registers: ===
3354:20101216:234610.714 r8 = 70 = 112 = 112
3354:20101216:234610.714 r9 = 0 = 0 = 0
3354:20101216:234610.714 r10 = 4000 = 16384 = 16384
3354:20101216:234610.714 r11 = 246 = 582 = 582
3354:20101216:234610.714 r12 = 0 = 0 = 0
3354:20101216:234610.714 r13 = 7fffca298f23 = 140736585109283 = 140736585109283
3354:20101216:234610.714 r14 = 7fffca2d13a0 = 140736585339808 = 140736585339808
3354:20101216:234610.714 r15 = 7370640 = 121046592 = 121046592
3354:20101216:234610.714 rdi = 0 = 0 = 0
3354:20101216:234610.714 rsi = 475cc3 = 4676803 = 4676803
3354:20101216:234610.714 rbp = 7fffca298db0 = 140736585108912 = 140736585108912
3354:20101216:234610.714 rbx = 0 = 0 = 0
3354:20101216:234610.714 rdx = 2b2878d2894b = 47452825749835 = 47452825749835
3354:20101216:234610.714 rax = 70 = 112 = 112
3354:20101216:234610.714 rcx = ffffffffffffffff = 18446744073709551615 = -1
3354:20101216:234610.714 rsp = 7fffca270f58 = 140736584945496 = 140736584945496
3354:20101216:234610.714 rip = 3f2627a1db = 271223071195 = 271223071195
3354:20101216:234610.714 efl = 10257 = 66135 = 66135
3354:20101216:234610.714 csgsfs = 33 = 51 = 51
3354:20101216:234610.714 err = 4 = 4 = 4
3354:20101216:234610.714 trapno = e = 14 = 14
3354:20101216:234610.714 oldmask = 0 = 0 = 0
3354:20101216:234610.714 cr2 = 0 = 0 = 0
3354:20101216:234610.714 === Backtrace: ===
3365:20101216:234610.733 Sending list of active checks to [192.168.150.10] failed: host [sposrhdbprim] not found
3354:20101216:234610.855 9: /usr/local/sbin/zabbix_server(print_fatal_info+0xdb) [0x43bacb]
3354:20101216:234610.855 8: /usr/local/sbin/zabbix_server(child_signal_handler+0xeb) [0x43b35b]
3354:20101216:234610.855 7: /lib64/libc.so.6 [0x3f262302d0]
3354:20101216:234610.855 6: /lib64/libc.so.6(strstr+0x1b) [0x3f2627a1db]
3354:20101216:234610.855 5: /usr/local/sbin/zabbix_server(get_value_ssh+0x2fd) [0x419bcd]
3354:20101216:234610.855 4: /usr/local/sbin/zabbix_server(main_poller_loop+0xc51) [0x417381]
3354:20101216:234610.855 3: /usr/local/sbin/zabbix_server(MAIN_ZABBIX_ENTRY+0x532) [0x40fdd2]
3354:20101216:234610.855 2: /usr/local/sbin/zabbix_server(daemon_start+0x20e) [0x43b13e]
3354:20101216:234610.855 1: /lib64/libc.so.6(__libc_start_main+0xf4) [0x3f2621d994]
3354:20101216:234610.855 0: /usr/local/sbin/zabbix_server [0x40c409]
.......
3354:20101216:234610.858 ================================
3354:20101216:234610.858 Please consider attaching a disassembly listing to your bug report.
3354:20101216:234610.858 This listing can be produced with, e.g., objdump -D -S zabbix_server.
3354:20101216:234610.858 ================================
3179:20101216:234610.953 One child process died (PID:3354,exitcode/signal:255). Exiting ...
3179:20101216:234612.956 Syncing history data...
3179:20101216:234612.991 Syncing history data... done.
3179:20101216:234612.991 Syncing trends data...
3179:20101216:234613.086 Syncing trends data... done.
3179:20101216:234613.086 Zabbix Server stopped. Zabbix 1.8.4rc2 (revision 15201).

Maybe there is an error. Never had this in other versions before.

Workaround would be to restart the zabbix_server.



 Comments   
Comment by richlv [ 2010 Dec 17 ]

while you still have that binary, could you please add objdump output ? (make sure to redirect it to a file & compress heavily )

Comment by Aleksandrs Saveljevs [ 2010 Dec 20 ]

The server seems to have crashed because libssh2_userauth_list() returned NULL (see http://www.libssh2.org/libssh2_userauth_list.html for a list of available reasons) and the previous code assumed a non-NULL value. The assumption has been fixed in development branch svn://svn.zabbix.com/branches/dev/ZBX-3301.

Comment by Aleksandrs Saveljevs [ 2010 Dec 20 ]

Merged into pre-1.8.4 in r16258.

Generated at Sat Apr 20 16:10:34 EEST 2024 using Jira 9.12.4#9120004-sha1:625303b708afdb767e17cb2838290c41888e9ff0.