[ZBX-6303] deadlocks in ORA-00060 in new zabbix 2.0.5 with Oracle g11r2 RAC Created: 2013 Feb 22 Updated: 2017 May 30 Resolved: 2013 Mar 28 |
|
Status: | Closed |
Project: | ZABBIX BUGS AND ISSUES |
Component/s: | Server (S) |
Affects Version/s: | 2.0.5 |
Fix Version/s: | None |
Type: | Incident report | Priority: | Blocker |
Reporter: | Olgierd Wolodkiewicz | Assignee: | Unassigned |
Resolution: | Duplicate | Votes: | 2 |
Labels: | crash, oracle | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified | ||
Environment: |
RHEL + 2 node Oracle g11R2 RAC |
Attachments: |
![]() ![]() ![]() ![]() |
||||||||
Issue Links: |
|
Description |
After upgrading to new 2.0.5 version we have: which causes to grow usage of cache write buffer and eventually to crash zabbix_server after a while (1-4h). Fix |
Comments |
Comment by Andris Zeila [ 2013 Feb 27 ] |
Can you attach the oracle trace file? There should be more information about resources/transactions involved. I don't think setting prefetch buffer would cause deadlocks (unless you have tested 2.05 version with removed Actually after looking more at the quoted errors the messages seem to be truncated. For Z3005 errors the failed sql statement should be printed after error message in []. Is it possible to attach the original log (or create a new one with DebugLevel 4 if possible). |
Comment by Ryan Rupp [ 2013 Mar 23 ] |
I'm seeing this as well on 2.0.5 (although I haven't tested against earlier versions) when batch updates to the items table are made, I've attached a snippet of the log. I didn't see anything else really relevant but if the full trace is useful let me know. I don't have any logs currently from the DB side. |
Comment by Andris Zeila [ 2013 Mar 25 ] |
If you could get full trace with DebugLevel 4 it might be useful. There is also possibility of similar deadlocks happening between frontend and server ( |
Comment by zabbixforme [ 2013 Mar 26 ] |
Hi. I have same problem on Oracle 11.2.0.3 and zabbix 2.0.5. Server crash log attached. |
Comment by zabbixforme [ 2013 Mar 26 ] |
Oracle trace file attached |
Comment by zabbixforme [ 2013 Mar 26 ] |
Second Oracle trace file attached. |
Comment by Andris Zeila [ 2013 Mar 26 ] |
Thanks for the logs. Apparently there were circular deadlock between history and lld updates. It should be fixed with lld item update optimizations in 2.2 ( Regarding the crash - there are no DBget_seq_maxid() functions in zabbix server sources, so my guess that was some third party patch to improve ID number generation on oracle (which again should be improved if not fixed in 2.2). |
Comment by zabbixforme [ 2013 Mar 26 ] |
Thanks for reply Andris Zeila. I have examined the code in the link ( Yes, we replaced DBget_seq_maxid() and use logic "select eventid_sequence.nextval from dual" to improve EVENT ID number. |
Comment by Andris Zeila [ 2013 Mar 27 ] |
That's true, but it should also help with lld deadlocks. Still we found another potential source of lld related deadlocks, so reopening to fix it. |
Comment by Olgierd Wolodkiewicz [ 2013 Mar 27 ] |
I have no access to DB logs - I can't provide this. |
Comment by Alexander Vladishev [ 2013 Mar 28 ] |
Fixed under |