[ZBXNEXT-8700] Add TBW support (for SATA SSDs) to Zabbix SMART template Created: 2023 Sep 12 Updated: 2024 Jun 12 |
|
Status: | Open |
Project: | ZABBIX FEATURE REQUESTS |
Component/s: | Agent2 plugin (G) |
Affects Version/s: | None |
Fix Version/s: | None |
Type: | New Feature Request | Priority: | High |
Reporter: | Dan MacDonald | Assignee: | Aleksandre Sebiskveradze |
Resolution: | Unresolved | Votes: | 1 |
Labels: | templates | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified |
Description |
Recently I have come to realise that a SATA SSD passing a long or short smartctl test on a SATA SSD doesn't mean that the drives performance hasn't degraded. It would seem that the correct way to test the health of a SATA SSD under Linux is to use one of these TBW (Total Bytes Written) scripts / commands which are based upon smartctl: https://askubuntu.com/questions/865792/how-can-i-monitor-the-tbw-on-my-samsung-ssd The current SMART template: https://www.zabbix.com/integrations/smart Has a percentage used item of NVMe disks but it seems to lack an equivalent for SATA disks which would be TBW. I would like to see the Zabbix SMART template upgraded to support TBW for at least Samsung SSDs. I think it should alert by default when the TBW drops below 90%. I have a couple of disks with a TBW of 87% and it has brought the speed of the array (pool) down drastically. |
Comments |
Comment by dimir [ 2023 Sep 21 ] |
Coming from https://www.zabbix.com/forum/zabbix-help/470638-monitoring-tbw-total-bytes-written-for-samsung-sata-ssds |
Comment by Dan MacDonald [ 2023 Sep 29 ] |
Actually, there's no need for that TBW script. For Samsung SATA SSDs you can get the wear levelling count from smartctl: smartctl -a /dev/sda | grep Wear | awk '{print $4}' |
Comment by Dan MacDonald [ 2024 Feb 29 ] |
I have found a Zabbix template that works for monitoring the wear levelling of Samsung SATA SSDs, even when they're connected to a RAID controller running in HBA/IT mode as is the case for me:
https://github.com/vargaloid/zabbix_template_samsung.ssd.smart
I have tested it with Zabbix 6.0, Ubuntu 22.04 and a Dell PowerEdge RAID controller of some sort. |
Comment by Dan MacDonald [ 2024 Feb 29 ] |
I have opened a ticket for it though because I think they need to adjust their default wear levelling trigger point by almost 90%
https://github.com/vargaloid/zabbix_template_samsung.ssd.smart/issues/2 |