es-master fails: Liveness probe failed: dial tcp 172.17.0.4:9300: getsockopt: connection refused

Question

es-master fails: Liveness probe failed: dial tcp 172.17.0.4:9300: getsockopt: connection refused

Closed this issue 7 years ago · 4 comments

Hi,

I'm having a problem launching the es-master deployment.

It keeps restarting and if I check the pod description the failure appears to be:
Liveness probe failed: dial tcp 172.17.0.4:9300: getsockopt: connection refused

After checking all the issues, open and closed (plus the documentation), I've made the following tests

giving minikube more memory
add

name: "NETWORK_HOST"
value: "eth0:ipv4"

add

name: "NETWORK_HOST"
value: "eth0"

None of them seems to work and the log does not help me too much.

Additional info about the environment: minikube cluster running on RHEL7

Answer 1 · 2018-03-19T16:15:52.000Z

I had the same problem. It seems like on a rather slow cluster the liveness probe strikes too fast.
Adding an initialDelaySeconds setting under the liveness probe in es-master.yaml and es-client.yaml helped me:

    livenessProbe:
      tcpSocket:
        port: transport
      initialDelaySeconds: 30

Answer 2 · 2018-03-21T14:13:57.000Z

@mbert thank you!

Answer 3 · 2018-04-01T16:06:41.000Z

Interesting, I had the same problem (trying this on GKE with a 3 n1-standard-2 cluster)

Using a 30 second initialDelay seconds caused 2 of 3 masters to work, and the third would constantly restart and inevitably enter a CrashLoopRestart cycle. (I also set NETWORK_HOST)

Setting it to 60 second delay seemed to solve the problem. I think there's some sort of master discovery algorithm that needs a bit of time to boostrap.

Answer 4 · 2018-08-07T09:25:42.000Z

@tarr11 thx! It works