[ZBX-20027] [template][Cisco UCS SNMP] wrong trigger expressions for Temperature monitoring Created: 2021 Sep 29 Updated: 2022 Mar 09 |
|
Status: | Open |
Project: | ZABBIX BUGS AND ISSUES |
Component/s: | Templates (T) |
Affects Version/s: | 5.4.4 |
Fix Version/s: | None |
Type: | Incident report | Priority: | Trivial |
Reporter: | thomas | Assignee: | Aleksandre Sebiskveradze |
Resolution: | Unresolved | Votes: | 2 |
Labels: | templates | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified | ||
Environment: |
Zabbix installation "All In One" VM |
Description |
Trigger prototype expressions for discovery rules "Temperature CPU Discovery" and "Temperaure Discovery" and host macros should be reviewed in default "Cisco UCS SNMP" template.
In discovery rule "Temperature CPU Discovery", macros below are used but not defined in host macros : $TEMP_WARN:"CPU"
In discovery rule "Temperaure Discovery" there are two issues. First, same macros are used regardless of sensor location : $TEMP_WARN:"Ambient" => 30°C I think these values are not appropriate to monitor IOH or server rear panel temperature. The result is that false positive problems are generated when monitoring an UCS server with default "Cisco UCS SNMP" template. Second issue is that $TEMP_CRIT_LOW:"Ambient" is used in prototype trigger expressions but not defined in host macros.
MariaDB [zabbix]> SELECT m.macro, m.value FROM hosts h, hostmacro m WHERE h.hostid=m.hostid AND h.name="Cisco UCS SNMP" AND m.macro LIKE "%$TEMP%"; +------------------------+-------+ | macro | value | +------------------------+-------+ | {$TEMP_CRIT:"Ambient"} | 35 | | {$TEMP_CRIT_LOW} | 5 | | {$TEMP_CRIT} | 60 | | {$TEMP_WARN:"Ambient"} | 30 | | {$TEMP_WARN} | 50 | +------------------------+-------+ 5 rows in set (0.001 sec) MariaDB [zabbix]> SELECT DISTINCT t.description,t.expression FROM triggers t,functions f,items i, hosts h WHERE h.hostid=i.hostid AND f.itemid=i.itemid AND f.triggerid=t.triggerid AND h.name="Cisco UCS SNMP" AND t.expression LIKE "%$TEMP%"; +----------------------------------------------------------------------------------------------+------------------------------------+ | description | expression | +----------------------------------------------------------------------------------------------+------------------------------------+ | {#SENSOR_LOCATION}.IOH: Temperature is above warning threshold: >{$TEMP_WARN:"Ambient"} | {19131}>{$TEMP_WARN:"Ambient"} | | {#SENSOR_LOCATION}.IOH: Temperature is above critical threshold: >{$TEMP_CRIT:"Ambient"} | {19133}>{$TEMP_CRIT:"Ambient"} | | {#SENSOR_LOCATION}.IOH: Temperature is too low: <{$TEMP_CRIT_LOW:"Ambient"} | {19135}<{$TEMP_CRIT_LOW:"Ambient"} | | {#SENSOR_LOCATION}.Ambient: Temperature is above warning threshold: >{$TEMP_WARN:"Ambient"} | {19113}>{$TEMP_WARN:"Ambient"} | | {#SENSOR_LOCATION}.Ambient: Temperature is above critical threshold: >{$TEMP_CRIT:"Ambient"} | {19115}>{$TEMP_CRIT:"Ambient"} | | {#SENSOR_LOCATION}.Ambient: Temperature is too low: <{$TEMP_CRIT_LOW:"Ambient"} | {19117}<{$TEMP_CRIT_LOW:"Ambient"} | | {#SENSOR_LOCATION}.Front: Temperature is above warning threshold: >{$TEMP_WARN:"Ambient"} | {19119}>{$TEMP_WARN:"Ambient"} | | {#SENSOR_LOCATION}.Front: Temperature is above critical threshold: >{$TEMP_CRIT:"Ambient"} | {19121}>{$TEMP_CRIT:"Ambient"} | | {#SENSOR_LOCATION}.Front: Temperature is too low: <{$TEMP_CRIT_LOW:"Ambient"} | {19123}<{$TEMP_CRIT_LOW:"Ambient"} | | {#SENSOR_LOCATION}.Rear: Temperature is above warning threshold: >{$TEMP_WARN:"Ambient"} | {19125}>{$TEMP_WARN:"Ambient"} | | {#SENSOR_LOCATION}.Rear: Temperature is above critical threshold: >{$TEMP_CRIT:"Ambient"} | {19127}>{$TEMP_CRIT:"Ambient"} | | {#SENSOR_LOCATION}.Rear: Temperature is too low: <{$TEMP_CRIT_LOW:"Ambient"} | {19129}<{$TEMP_CRIT_LOW:"Ambient"} | | {#SENSOR_LOCATION}: Temperature is above warning threshold: >{$TEMP_WARN:"CPU"} | {19137}>{$TEMP_WARN:"CPU"} | | {#SENSOR_LOCATION}: Temperature is above critical threshold: >{$TEMP_CRIT:"CPU"} | {19139}>{$TEMP_CRIT:"CPU"} | | {#SENSOR_LOCATION}: Temperature is too low: <{$TEMP_CRIT_LOW:"CPU"} | {19141}<{$TEMP_CRIT_LOW:"CPU"} | +----------------------------------------------------------------------------------------------+------------------------------------+ 15 rows in set (0.002 sec)
|