Startin zabbix in single node HA causes history cache to fill up and queues to grow "indefinetly"

XMLWordPrintable

    • Type: Incident report
    • Resolution: Unresolved
    • Priority: Trivial
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Environment:

      Steps to reproduce:

      1. Set HANodeName to <hostname> and NodeAddress to <host_fqdn> on a single zabbix server host.
      2. Restart zabbix
      3. Increase HistoryCacheSize and HistoryIndexCacheSize
      4. restart zabbix

       

      Result:

      History cache fills up and when it's full, queues grow rapidly until zabbix is shut down again. A few values are recieved, but the queues show no signs of being processed as one would expect.

      When HistoryCacheSize is increased, zabbix functions normally, (ie queues are low, values are processed) until history cache fills up again and queues subsequently start to grow.

      After disabling HA, the history cache is swiftly emptied and queues qucikly processed and zabbix returns to a stable state.

       

      Attached 4 sets of graphs showing caches and queue for the 4 times zabbix was restarted with different settings.

       

      First restart

      This enabled HA by setting:

      HANodeName=myzabbixhost
      NodeAddress=myzabbixhost.mydomain.com

       

      Cache settings:

      HistoryCacheSize=128M
      HistoryIndexCacheSize=64M

       

      Timestamps:

      20250610:121911.986 Got signal [signal:15(SIGTERM),sender_pid:4035515,sender_uid:0,reason:0]. Exiting ...
      20250610:121926.055 Zabbix Server stopped.
      20250610:121926.113 Starting Zabbix Server.


      **

      Second restart (HA enabled)

      Cache settings:

      HistoryCacheSize=512M
      HistoryIndexCacheSize=128M

      Timestamps:

      20250610:134911.167 Got signal [signal:15(SIGTERM),sender_pid:298268,sender_uid:0,reason:0]. Exiting ...
      20250610:140538.395 Starting Zabbix Server.

      Third restart (HA enabled)

      Cache settings:

      HistoryCacheSize=1G
      HistoryIndexCacheSize=512M

      20250610:143652.439 Got signal [signal:15(SIGTERM),sender_pid:490534,sender_uid:0,reason:0]. Exiting ...
      20250610:144432.535 Zabbix Server stopped.
      20250610:144432.574 Starting Zabbix Server.


      **

      Fourth restart - HA disabled

      Cache settings:

      HistoryCacheSize=2G
      HistoryIndexCacheSize=512M

       

      Timestamps:

      20250610:153343.619 Got signal [signal:15(SIGTERM),sender_pid:745894,sender_uid:0,reason:0]. Exiting ...
      20250610:160815.365 Zabbix Server stopped.
      20250610:160815.404 Starting Zabbix Server.

      Expected:

      That enabling HA on a single node would not cause these issues and History cache to be emptied quickly after restart and value processing to continue as when started without HA enabled.

        1. 1_caches.png
          1_caches.png
          69 kB
        2. 1_queue.png
          1_queue.png
          50 kB
        3. 2_caches.png
          2_caches.png
          61 kB
        4. 2_queue.png
          2_queue.png
          54 kB
        5. 3_caches.png
          3_caches.png
          61 kB
        6. 3_queue.png
          3_queue.png
          43 kB
        7. 4_caches.png
          4_caches.png
          57 kB
        8. 4_queue.png
          4_queue.png
          49 kB

            Assignee:
            Zabbix Support Team
            Reporter:
            Göran Eriksson
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated: