Get value from agent failed: ZBX_TCP_READ() timed out
dmtx97 opened this issue · 1 comments
First of all, I love the template! Occasionally I receive the error listed above on my backup server. Below is the log that is associated with this issue. After a few minutes, connection is restored and it works fine. I have tested the scripts locally and they work fine. Just having them work on Zabbix seems to be the issue. I am not sure if this is an issue with my actual network, or if script execution is causing this. Thank you.
17415:20190411:085014.467 Zabbix agent item "vbr[ResultBackup,1686a71c-dd85-414b-930e-539a87cd91d5]" on host "pcorpbkp01.adc.dbu.edu" failed: another network error, wait for 15 seconds
17422:20190411:085032.735 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
17415:20190411:085039.478 temporarily disabling Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": host unavailable
17450:20190411:085115.440 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
17448:20190411:085136.468 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
17416:20190411:085141.219 enabling Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": host became available
17470:20190411:085231.879 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
17424:20190411:085315.425 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
17490:20190411:085336.125 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
17486:20190411:085432.017 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
17367:20190411:085438.005 Zabbix agent item "vbr[VmCountResultBackup,Programing Workstation,Failed]" on host "pcorpbkp01.adc.dbu.edu" failed: first network error, wait for 15 seconds
17415:20190411:085457.817 resuming Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": connection restored
17476:20190411:085515.371 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
17417:20190411:085536.812 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
17486:20190411:085632.156 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
17435:20190411:085715.285 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
17489:20190411:085736.454 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
17448:20190411:085832.298 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
17500:20190411:085915.281 cannot send list of active checks to "172.17.1.160": host [.adc.dbu.edu] not found
17427:20190411:085936.090 cannot send list of active checks to "172.17.2.18": host [.adc.dbu.edu] not found
17416:20190411:085949.936 Zabbix agent item "vbr[ResultBackup,e3cfcccb-62e2-4a6c-a547-003c52657432]" on host "pcorpbkp01.adc.dbu.edu" failed: first network error, wait for 15 seconds
17415:20190411:090014.981 Zabbix agent item "vbr[ResultBackup,1686a71c-dd85-414b-930e-539a87cd91d5]" on host "pcorpbkp01.adc.dbu.edu" failed: another network error, wait for 15 seconds
17504:20190411:090032.442 cannot send list of active checks to "172.17.5.3": host [pcorpsqlaa.adc.dbu.edu] not found
17416:20190411:090039.016 temporarily disabling Zabbix agent checks on host "pcorpbkp01.adc.dbu.edu": host unavailable
Hello,
I am thinking more of a problem of insufficient resources in Zabbix server or agent.
It should be checked on the Zabbix server if the Zabbix busy poller processes and Zabbix busy unreachable poller processes elements do not cap at 100% :
If this is the case, it will be necessary to add poller instances in zabbix-server.conf.
Also the Number of pre-forked instances of zabbix_agentd can be increased (ex: StartAgents=10).
Then also check that the pcorpsqlaaa.adc.adc.dbu.edu server remains accessible. Can be tested with the Zabbix Ping template (icmp).