[ZBX-13598] Zabbix Crash - Unreachable Poller Created: 2018 Mar 09  Updated: 2024 Apr 10  Resolved: 2018 Apr 06

Status: Closed
Project: ZABBIX BUGS AND ISSUES
Component/s: Server (S)
Affects Version/s: 3.4.7
Fix Version/s: 3.4.9rc1, 4.0.0alpha6, 4.0 (plan)

Type: Incident report Priority: Major
Reporter: Leandro Rocha Assignee: Viktors Tjarve
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Ubuntu 16.04.4 LTS with mysql Ver 14.14 Distrib 5.7.21, for Linux (x86_64) using EditLine wrapper


Attachments: File ZBX-13598.diff     Text File zabbix-crash1.txt     Text File zabbix-crash2.txt     Text File zabbix-crash3.txt    
Team: Team A
Sprint: Sprint 30, Sprint 31
Story Points: 0.25

 Description   

My Zabbix start to crash random, sometime is one time per day, others is five time per day...
I put the log in the attachment, if you need a high debug level please tell me.



 Comments   
Comment by Vladislavs Sokurenko [ 2018 Mar 09 ]

Backtrace for easier searching

======= Backtrace: =========
/lib/x86_64-linux-gnu/libc.so.6(+0x777e5)[0x7f6fffca87e5]
/lib/x86_64-linux-gnu/libc.so.6(+0x8037a)[0x7f6fffcb137a]
/lib/x86_64-linux-gnu/libc.so.6(cfree+0x4c)[0x7f6fffcb553c]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](zbx_dvsprintf+0xcd)[0x4b016d]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](__zbx_zbx_dsprintf+0x94)[0x4b0234]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](zbx_ipc_socket_open+0x1f6)[0x4be1e6]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values][0x45e556]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](zbx_preprocessor_flush+0x27)[0x45f247]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values][0x430edb]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](poller_thread+0xf8)[0x431528]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](zbx_thread_start+0x45)[0x4a7635]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](MAIN_ZABBIX_ENTRY+0x789)[0x424309]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](daemon_start+0x1bf)[0x49dcef]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](main+0x3bf)[0x41b95f]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf0)[0x7f6fffc51830]
/usr/local/sbin/zabbix_server: unreachable poller #6 [got 0 values in 0.000091 sec, getting values](_start+0x29)[0x41bc89]
Comment by Vladislavs Sokurenko [ 2018 Mar 09 ]

If possible, then please try patch attached, thank you for your report!

Comment by Leandro Rocha [ 2018 Mar 09 ]

Sorry for asking but, how i patch ?

Comment by Vladislavs Sokurenko [ 2018 Mar 09 ]

This should do the trick
patch -p0 -i ZBX-13598.diff

How many pollers do you have ? It looks like new poller cannot create any more sockets.

Comment by Leandro Rocha [ 2018 Mar 09 ]

Pollers:

### Option: StartPollers
# StartPollers=5
### Option: StartIPMIPollers
StartIPMIPollers=0
### Option: StartPollersUnreachable
StartPollersUnreachable=10
### Option: StartHTTPPollers
# StartHTTPPollers=1
### Option: StartJavaPollers
# StartJavaPollers=0
### Option: StartProxyPollers
StartProxyPollers=3

Another crash ( still dont patch ), i put a new attachment.

Comment by Leandro Rocha [ 2018 Mar 09 ]

Patch applied.

If i get another crash i will put here.

Comment by Leandro Rocha [ 2018 Mar 10 ]

Another "crash" after the patch.

Comment by Vladislavs Sokurenko [ 2018 Mar 10 ]

Yes, now it exits intentionally, with an error: what is your SocketDir ?

 16997:20180310:031416.049 cannot connect to preprocessing service: Cannot connect to service "preprocessing": [2] No such file or directory.
 16969:20180310:031416.133 One child process died (PID:16997,exitcode/signal:1). Exiting ...
 16969:20180310:031418.180 syncing history data...
 16969:20180310:031418.184 syncing history data done
 16969:20180310:031418.184 syncing trend data...
 16969:20180310:031420.259 syncing trend data done
 16969:20180310:031420.259 Zabbix Server stopped. Zabbix 3.4.7 (revision 77720).
Comment by Leandro Rocha [ 2018 Mar 10 ]

Sorry for ask, but what is a "SocketDir" ?

Comment by Vladislavs Sokurenko [ 2018 Mar 10 ]

https://www.zabbix.com/documentation/3.4/manual/appendix/config/zabbix_server
Directory to store IPC sockets used by internal Zabbix services.

Does setting it to something other than /tmp help ?

Comment by Leandro Rocha [ 2018 Mar 10 ]

No.

Comment by Vladislavs Sokurenko [ 2018 Mar 10 ]

It looks like some other process deleted socket that zabbix server created, that's why Zabbix server exits.

Comment by Leandro Rocha [ 2018 Mar 15 ]

Vladislavs you are correct.

I have a crontab schedule to clear the /tmp for others reasons.

Now i put some exception and four days without a crash.

Thank you for the help.

Comment by Viktors Tjarve [ 2018 Apr 03 ]

Fixed in development branch svn://svn.zabbix.com/branches/dev/ZBX-13598

Comment by Andris Zeila [ 2018 Apr 04 ]

Successfully tested

Comment by Viktors Tjarve [ 2018 Apr 04 ]

Released in:

  • 3.4.9rc1 r79295, r79386
  • 4.0.0alpha6 r79297, r79388
Generated at Fri Apr 26 20:58:33 EEST 2024 using Jira 9.12.4#9120004-sha1:625303b708afdb767e17cb2838290c41888e9ff0.