Old container's ip was not removed from ingress network after execute `docker serivce update` sometimes

liusf12 · September 26, 2024, 3:34am

I’m using docker swarm and docker service to manage my app.

Recently I updated a service with 2 tasks, and after about half an hour later, there is a 50% chance that the service is inaccessible through service name and vip.

I executed nslookup tasks.myservice, and found 4 ip listed, 2 are current running container’s ip, the other 2 are the stopped containers’ ip.

I executed the same command nslookup tasks.myservice on the two running containers, the result is a little different, only 3 ip found, the stopped container’s ip on the same host was not in the list.
It seems that, while the task stopped, the ip is removed from the host, but it failed to synchronize the information to swarm cluster.

And I found a message like msg="rmServiceBinding d0283c4f3f93e348e91b5239d30d1af0921e0f606ced2f057e85d8997ec7e8c9 possible transient state ok:false entries:0 set:false " from dockerd log.

My temporary fix is, make the node leave the swarm cluster and join it again.Is there any suggestion to investigate the root cause or how to avoid this issue? Thanks in advance.

My system info

uname -a
Linux dongni-nginx2 5.10.134-15.an8.x86_64 #1 SMP Thu Jul 20 00:35:47 CST 2023 x86_64 x86_64 x86_64 GNU/Linux

docker info 
Client: Docker Engine - Community
 Version:    24.0.7
 Context:    default

bluepuma77 · September 26, 2024, 8:53am

How many nodes are you using, how many are managers, are they geographically separated, are you using VLAN/VPN?

liusf12 · September 26, 2024, 2:30pm

There are abount 45 node in swarm cluster, nodes’ quantity is not fixed because some services use elastic scaling, 5 manager nodes in cluster. We are using aliyun cloud service, all the nodes in the same region, but not in the same rack, No VLAN/VPN used.

Topic		Replies	Views
IPs stuck when resolving service using docker network Swarm docker , swarm	0	654	April 12, 2021
Docker swarm load balance container ip clashes with service container ip Swarm	0	1145	December 20, 2019
DNS resolves wrong IP for deployed service Swarm dns	0	2344	April 24, 2018
Docker swarm networking:attempts to use same ip in mutiple containers General	0	1173	July 12, 2016
Docker Swarm service VIP change condition General swarm	1	2101	September 10, 2018

Old container's ip was not removed from ingress network after execute `docker serivce update` sometimes

Related topics