Uploaded image for project: 'ZABBIX BUGS AND ISSUES'
  1. ZABBIX BUGS AND ISSUES
  2. ZBX-7936

wonky connection to agent due to "Interrupted system call" (cloned - reopened)

    Details

    • Type: Incident report
    • Status: Confirmed
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
    • Environment:
      Fedora 17 x86_64, selinux in permissive mode, kernel 3.5.2-1.fc17.x86_64

      Description

      Server log file is full of following:

       30907:20120821:225740.503 Zabbix agent item [net.tcp.service[smtp]] on host [localhost] failed: another network error, wait for 15 seconds
       30907:20120821:225755.505 resuming Zabbix agent checks on host [localhost]: connection restored
       30904:20120821:225822.907 Zabbix agent item [net.tcp.service[smtp]] on host [localhost] failed: first network error, wait for 15 seconds
       30907:20120821:225840.602 Zabbix agent item [net.tcp.service[smtp]] on host [localhost] failed: another network error, wait for 15 seconds
       30907:20120821:225855.605 resuming Zabbix agent checks on host [localhost]: connection restored
      

      There are problems collecting items (there are random gaps in graphs).

      There's nothing relevant in agent log.
      smtp server is running:

      $ sudo -u zabbix telnet localhost smtp
      Trying 127.0.0.1...
      Connected to localhost.
      Escape character is '^]'.
      

      Selinux is in permissive mode.

      Running in debug more adds more information:

      Server log:

       30679:20120821:225421.743 sleeping for 3 seconds
       30702:20120821:225421.947 In collect_selfmon_stats()
       30702:20120821:225421.948 End of collect_selfmon_stats()
       30702:20120821:225421.948 sleeping for 1 seconds
       30682:20120821:225422.239 Item [localhost:net.tcp.service[smtp]] error: Get value from agent failed: ZBX_TCP_READ() failed: [4] Interrupted system call
       30682:20120821:225422.239 In zabbix_log()
       30682:20120821:225422.239 In DCconfig_get_items() hostid:0 key:'zabbix[log]'
       30682:20120821:225422.239 End of DCconfig_get_items():0
       30682:20120821:225422.239 End of zabbix_log()
       30682:20120821:225422.239 End of get_value():NETWORK_ERROR
       30682:20120821:225422.239 In deactivate_host() hostid:10017 itemid:79 type:0
       30682:20120821:225422.239 deactivate_host() errors_from:0 available:1
       30682:20120821:225422.239 query [txnlev:1] [begin;]
       30682:20120821:225422.239 query [txnlev:1] [update hosts set errors_from=1345586062,disable_until=1345586077 where hostid=10017]
       30682:20120821:225422.240 query [txnlev:1] [commit;]
       30682:20120821:225422.306 Zabbix agent item [net.tcp.service[smtp]] on host [localhost] failed: first network error, wait for 15 seconds
       30682:20120821:225422.306 In zabbix_log()
       30682:20120821:225422.306 In DCconfig_get_items() hostid:0 key:'zabbix[log]'
       30682:20120821:225422.306 End of DCconfig_get_items():0
       30682:20120821:225422.306 End of zabbix_log()
       30682:20120821:225422.306 End of deactivate_host()
       30682:20120821:225422.306 End of get_values()
       30682:20120821:225422.307 poller #4 spent 3.068150 seconds while updating 1 values
       30682:20120821:225422.307 In DCconfig_get_poller_nextcheck() poller_type:0
       30682:20120821:225422.307 End of DCconfig_get_poller_nextcheck():1345586064
       30682:20120821:225422.307 sleeping for 2 seconds
      

      agent log:

      29584:20120821:225421.275 Processing request.
       29584:20120821:225421.275 Requested [proc.num[,,run]]
       29584:20120821:225421.295 Sending back [1]
       29585:20120821:225421.615 In send_buffer() host:'127.0.0.1' port:10051 values:0/100
       29585:20120821:225421.615 End of send_buffer():SUCCEED
       29585:20120821:225421.615 Sleeping for 1 second(s)
       29581:20120821:225421.713 In update_cpustats()
       29581:20120821:225421.713 End of update_cpustats()
       29583:20120821:225422.239 TCP expect network error: ZBX_TCP_READ() failed: [4] Interrupted system call
       29583:20120821:225422.239 Sending back [0]
       29583:20120821:225422.239 Got signal [signal:13(SIGPIPE),sender_pid:29583]. Ignoring ...
       29583:20120821:225422.239 Process listener error: ZBX_TCP_WRITE() failed: [32] Broken pipe
       29585:20120821:225422.615 In send_buffer() host:'127.0.0.1' port:10051 values:0/100
       29585:20120821:225422.615 End of send_buffer():SUCCEED
       29585:20120821:225422.615 Sleeping for 1 second(s)
       29581:20120821:225422.713 In update_cpustats()
       29581:20120821:225422.713 End of update_cpustats()
       29585:20120821:225423.615 In send_buffer() host:'127.0.0.1' port:10051 values:0/100
       29585:20120821:225423.615 End of send_buffer():SUCCEED
       29585:20120821:225423.615 Sleeping for 1 second(s)
       29581:20120821:225423.714 In update_cpustats()
       29581:20120821:225423.714 End of update_cpustats()
       29585:20120821:225424.615 In send_buffer() host:'127.0.0.1' port:10051 values:0/100
       29585:20120821:225424.615 End of send_buffer():SUCCEED
      

        Attachments

        1. mysqlperf.png
          mysqlperf.png
          75 kB
        2. sysperf.png
          sysperf.png
          106 kB
        3. ZBX-7936.tar.gz
          42 kB

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              bagni Lorenzo Bagni
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: