Uploaded image for project: 'ZABBIX BUGS AND ISSUES'
  1. ZABBIX BUGS AND ISSUES
  2. ZBX-23356

100% cache-hit, loosing events

XMLWordPrintable

    • Icon: Problem report Problem report
    • Resolution: Commercial support required
    • Icon: Trivial Trivial
    • None
    • 6.0.20
    • None
    • None
    • We have a single zabbix server installation, with two proxies. The server is running on a VM (vmware) using a PGSQL with timescaleDB on a different system. The db is around 650G. The system is managing 2700 hosts, 144k items, 81k triggers.

      The system is low on CPU and has more that 50% availiable memory. 

      The problem is that when we start the server, it takes about 3 hours to reach 100% History write cache and after that the poller utilization goes to 100%. 

      After that the system is not updating any graphs. 

      The PGSQL db has low CPU and mem usage, we did not found anything out of the ordinary. The disk file system sits on an NMVE enterprise disk array.

      When we stop the zabbix server it takes roughly 6 hours to stop the process as it needs to write data to the DB. 

      We need to find out what is wrng with the setup.

        1. cache_hit.png
          cache_hit.png
          24 kB
        2. cpu.png
          cpu.png
          90 kB
        3. data_gathering.png
          data_gathering.png
          265 kB
        4. data_handling.png
          data_handling.png
          143 kB
        5. Internal_process.png
          Internal_process.png
          253 kB
        6. poller.png
          poller.png
          101 kB
        7. queue_depth.png
          queue_depth.png
          40 kB
        8. utilization.png
          utilization.png
          168 kB

            zabbix.support Zabbix Support Team
            kzourkas Kostas Zourkas
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: