I have a docker swarm, currently consisting of two nodes. It uses no special settings except for 192.168.0.0/16 address space since I use 10.0.0.0 in my home network.
The swarm runs fine, but every few hours a node freezes (I can ping it, but no SSH or anything else). I found the following errors in the log files:
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.413073908+01:00" level=info msg="Node 0976f5ad5134/10.0.1.0, joined gossip cluster"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.413134038+01:00" level=info msg="Node 0976f5ad5134/10.0.1.0, added to nodes list"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.479811612+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.479892612+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.479907649+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.479921612+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.479934778+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.479947501+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.480917408+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.480955649+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.480972038+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.480987223+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.480999741+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.481017834+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.508647853+01:00" level=info msg="API listen on /var/run/docker.sock"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.508793371+01:00" level=info msg="API listen on [::]:2375"
Sep 25 20:58:49 swarm1 systemd[1]: Started docker.service - Docker Application Container Engine.
░░ Subject: A start job for unit docker.service has finished successfully
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░
░░ A start job for unit docker.service has finished successfully.
░░
░░ The job identifier is 119.
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.561703945+01:00" level=info msg="initialized VXLAN UDP port to 4789 " module=node node.id=sz730ux3bc1nrfsbrf4msy6cw
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.607927556+01:00" level=error msg="set bridge default vlan failed" error="failed to enable default vlan on bridge br0: open /sys/class/net/br0/bridge/default_pvid: permission denied"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.731934501+01:00" level=info msg="initialized VXLAN UDP port to 4789 " module=node node.id=sz730ux3bc1nrfsbrf4msy6cw
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.986662686+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.986766371+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.986786574+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.986814908+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.986829315+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:49 swarm1 dockerd[862]: time="2024-09-25T20:58:49.986843241+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.139640426+01:00" level=error msg="set bridge default vlan failed" error="failed to enable default vlan on bridge br0: open /sys/class/net/br0/bridge/default_pvid: permission denied"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.298607019+01:00" level=info msg="Node f65fbe5e16eb/10.0.1.1, joined gossip cluster"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.298653426+01:00" level=info msg="Node f65fbe5e16eb/10.0.1.1, added to nodes list"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432251815+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432347963+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432370019+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432583371+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432620093+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432638778+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432707889+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432763445+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432779371+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432802296+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_nodest_conn" error="open /proc/sys/net/ipv4/vs/expire_nodest_conn: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432816648+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.expire_quiescent_template" error="open /proc/sys/net/ipv4/vs/expire_quiescent_template: no such file or directory"
Sep 25 20:58:50 swarm1 dockerd[862]: time="2024-09-25T20:58:50.432830148+01:00" level=error msg="error reading the kernel parameter net.ipv4.vs.conn_reuse_mode" error="open /proc/sys/net/ipv4/vs/conn_reuse_mode: no such file or directory"
Can anybody tell me what those errors mean? They seem to be ip related, but I changed the address space which my docker networks are all abiding by