Docker swarm heartbeat to manager failed

zhenyu2048 · September 25, 2019, 12:38am

I observed the following behavior:
I ran docker swarm and a docker registry on one VM. When running docker push to publish a big docker image to the docker registry, sometimes I found all docker swarm service restarted, and I saw the following errors in docker log:
*level=error msg=“heartbeat to manager {**} failed” error=“rpc error: code = InvalidArgument desc = session invalid”**method="(session).heartbeat" module=node/agent

After some research looks like the dispatcher-heartbeat parameter of docker swarm is related, and the issue is gone after I increase it from the default 5 sec to 10 sec.

I suspect during the docker push of the big docker image, the network is very busy so the swarm heartbeat packet is lost or delayed. And even though there is only one node in the swarm cluster, there is still heartbeat (a loopback from the dockerd to itself?).

Can somebody help to confirm my understanding?

Thanks.

Topic		Replies	Views
Containers rebooting because "heartbeat to manager failed" Swarm	3	10243	July 3, 2019
Docker swarm rebuilds all containers at different times Swarm swarm	16	3603	May 8, 2025
All docker service in docker swarm suddenly restarted General docker , swarm	1	3383	September 26, 2022
Swarm manager loses state in one-node swarm General docker , swarm	13	6011	March 18, 2024
Docker Swarm: bulk sync to node failed, General docker , swarm	0	4594	March 31, 2020

Docker swarm heartbeat to manager failed

Related topics