Since DB failover was executed without causing Zabbix server to go down in Zabbix 5.0, we anticipate consistent behavior as we had prior to Zabbix 5.0.
The Zabbix server working with redundant DB was stopped during DB failover process.
During the failover process, The DB returns an error because it cannot process queries.
It seems that the failed query was not retried and causes HA manager to stop and Zabbix server to go down.
quoted from zabbix_server.log:
When a DB fails over in an Act/Standby configuration, there may be a time when the DB cannot be updated.
So if HA manager stops due to an error like above, it means that the Act/Standby DB configuration is not available.
I think we need to make sure that HA manager does not stop in such situations, or provide a documentation to explain it to our customer.
Can you please improve this error handling?
Please see attached log file as well.