-
Incident report
-
Resolution: Fixed
-
Major
-
None
-
1.4.2
-
None
-
Linux, all flavors
In the log below, there is a 16 minute gap where the agent shows that it doesn't do anything. The first thing after the agent continues is that it says the following error:
13962:20071108:094205 Send value error: ZBX_TCP_READ() failed [Connection timed out]
After this, it works fine again for awhile and it eventually happens again.
This happens on a few of our servers intermittently. We only use active checks.
Is there a solution to this?
13962:20071108:092607 Parsed [vm.memory.size[buffers]:600:0]
13962:20071108:092607 In add_check('vm.memory.size[buffers]', 600, 0)
13962:20071108:092607 Parsed [vm.memory.size[cached]:600:0]
13962:20071108:092607 In add_check('vm.memory.size[cached]', 600, 0)
13962:20071108:092607 Parsed [vm.memory.size[free]:450:0]
13962:20071108:092607 In add_check('vm.memory.size[free]', 450, 0)
13962:20071108:092607 Parsed [vm.memory.size[shared]:600:0]
13962:20071108:092607 In add_check('vm.memory.size[shared]', 600, 0)
13962:20071108:092607 Parsed [vm.memory.size[total]:86400:0]
13962:20071108:092607 In add_check('vm.memory.size[total]', 86400, 0)
13962:20071108:092607 Parsed [ZBX_EOF]
13962:20071108:092607 In process_active_checks('server.domain.com',10051)
13962:20071108:092607 For key [agent.ping] received value [1]
13962:20071108:092607 XML before sending [<req><host>c3RyZWFtLnNwaW1haWw=</host><key>YWdlbnQucGl
uZw==</key><data>MQ==</data></req>]
13962:20071108:092607 OK
13962:20071108:092607 In get_min_nextcheck()
13962:20071108:092607 Sleeping for 13 seconds
13962:20071108:092620 In process_active_checks('server.domain.com',10051)
13962:20071108:092620 For key [system.cpu.load] received value [0.160000]
13962:20071108:092620 XML before sending [<req><host>c3RyZWFtLnNwaW1haWw=</host><key>c3lzdGVtLmN
wdS5sb2Fk</key><data>MC4xNjAwMDA=</data></req>]
13962:20071108:094205 Send value error: ZBX_TCP_READ() failed [Connection timed out]
13962:20071108:094205 In get_min_nextcheck()
13962:20071108:094205 No sleeping
13962:20071108:094205 In refresh_metrics('server.domain.com',10051)
13962:20071108:094205 get_active_checks('server.domain.com',10051)
13962:20071108:094208 Sending [ZBX_GET_ACTIVE_CHECKS
stream.server
]
13962:20071108:094208 Before read
13962:20071108:094208 In parse_list_of_checks('agent.ping:30:0
agent.version:86400:0