[ZBX-12990] VMware Datastore usage mismatch Created: 2017 Nov 06  Updated: 2024 Apr 10  Resolved: 2018 Jun 05

Status: Closed
Project: ZABBIX BUGS AND ISSUES
Component/s: Proxy (P), Server (S)
Affects Version/s: 3.2.7
Fix Version/s: 3.0.19rc1, 3.4.11rc1, 4.0.0alpha8, 4.0 (plan)

Type: Incident report Priority: Major
Reporter: Érick Rangel Gomes Assignee: Michael Veksler
Resolution: Fixed Votes: 0
Labels: vmware
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Debian 3.16.0-4-amd64, mysql Ver 15.1 Distrib 10.0.30-MariaDB, for debian-linux-gnu (x86_64) using readline 5.2, 2 VCPU, 4GB RAM, VMware ESXi 5.5.0 build-2068190.


Attachments: PNG File VMMare monitoring NEW.png     PNG File VMMare monitoring.png     PNG File Zabbix monitoring NEW.PNG     PNG File Zabbix monitoring.PNG     PNG File collecting metrics for that datastore.PNG    
Issue Links:
Causes
causes ZBX-15142 LC_ALL=en_US.UTF-8 locale on server c... Closed
causes ZBX-14677 wmvare monitoring (datastore,NIC,perf... Closed
causes ZBX-14548 cannot get refresh rate for Datastore... Closed
caused by ZBX-15417 VMWare datastore monitoring loses *al... Closed
Team: Team C
Team: Team C
Sprint: Sprint 33, Sprint 34, Sprint 35
Story Points: 1

 Description   

Steps to reproduce:
On VMWare host, create a VM snapshot.

Result:
The used space did not change when the snapshot create the files on the datastore.

Expected:
The Zabbix monitoring (attached image) must follow the VMWare monitoring(attached image).



 Comments   
Comment by Edmunds Vesmanis [ 2017 Nov 06 ]

Hi Érick!

Do I understand correctly from provided graph,
Snapshot was taken about 11:45 AM and then it grew by more than 30 GB till 7:00 AM when it reached the capacity of that datastore?

Clould you post a Key and a screenshot of the item collecting metrics for that datastore.

Best regards,
Edmunds

Comment by Edmunds Vesmanis [ 2017 Nov 06 ]

The issue is on the VMware side:
https://kb.vmware.com/s/article/2110199

Can confirm that while there is some data written to the storage those counters are not updated. Only when writing process stops you get correct data.

Comment by Érick Rangel Gomes [ 2017 Nov 06 ]

The referenced KB is applied on the ESXi host.
About the previsous questions :

"...Snapshot was taken about 11:45 AM and then it grew by more than 30 GB till 7:00 AM when it reached the capacity of that datastore?"

Answer: Yes, the capacity was reached about 07:00 AM.

"Clould you post a Key and a screenshot of the item collecting metrics for that datastore."

Answer:The item used are :

vmware.hv.datastore.size[{$URL},{HOST.HOST},8_ALM_Primary_DB_Datafiles,free]

The requested screednshot was attached.

Now we are doing another test and the issue it's happening again. In other datastore we transfer a file directly to the datastore (not to a VM) and the space did not change for Zabbix.
I attached two NEWs images that have the value mismatch. On ESXi, the value are 950GB (correct) but on Zabbix still 961GB.

Comment by Edmunds Vesmanis [ 2017 Nov 09 ]

As I found out from developers - zabbix is getting storage info from configuration data instead of performance counters.

Also testing shows (esxi 5.5, 6.0, 6.5) that zabbix won't get info on storage changes unless triggered manually only on vSphere 6.X config is updated every 30 minutes and storage usage changes are updated.

Workaround is to regularly trigger storage rescan via api or Powercli

Comment by Oleksii Zagorskyi [ 2018 Nov 07 ]

It caused a regression - ZBX-15142

Generated at Fri Apr 19 09:19:46 EEST 2024 using Jira 9.12.4#9120004-sha1:625303b708afdb767e17cb2838290c41888e9ff0.