Uploaded image for project: 'ZABBIX FEATURE REQUESTS'
  1. ZABBIX FEATURE REQUESTS
  2. ZBXNEXT-4545

Limit item processing errors in log

XMLWordPrintable

    • Icon: New Feature Request New Feature Request
    • Resolution: Unresolved
    • Icon: Critical Critical
    • None
    • 3.4.9
    • Server (S)
    • None

      We have a case whereby a template used for SMART monitoring covers both HDD and SSD hosts. The items in the template are of type dependant. In some cases, there are items that can't be evaluated, as the value cannot be found (for example Percentage Lifetime Used is only available on SSD's not HDD's.

      Each time the item is processed, an error is logged in the Zabbix Server log file, which dumps out the entire output, as follows:-

      12689:20180510:114743.525 error reason for "host1-example:uHDD["/dev/bus/0 -d sat+megaraid,0",202]" changed: Item preprocessing step #1 failed: cannot perform regular expression match on value "smartctl 6.4 2014-10-07 r4002 [x86_64-linux-3.16.0-5-amd64] (local build)
      Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF READ SMART DATA SECTION ===
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
      3 Spin_Up_Time 0x0027 182 180 021 Pre-fail Always - 7875
      4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 15
      5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
      7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
      9 Power_On_Hours 0x0032 055 055 000 Old_age Always - 33251
      10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
      11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
      12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 15
      192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 14
      193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 209
      194 Temperature_Celsius 0x0022 115 108 000 Old_age Always - 37
      196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
      197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
      198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
      199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
      200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0" of type "string": pattern does not match
      
      

      We are monitoring thousands of hosts with Zabbix, and are seeing the log files hit anything around 5G. Log rotate does take care of this eventually.

      Ideas:-

       * Limit the output of a failed item preprocessing to X characters

       * Have the ability to "mute" item errors, either per item or a global option.

       

      Thanks as always!

       

            Unassigned Unassigned
            duncan9562 Duncan Mountford
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated: