Autoscaling in docker swarm

orj123 · January 12, 2018, 2:52pm

HI Team,

We have implemented docker swarm in our production environment.
But now we want to know if auto scaling is possible in docker swarm.

If yes how , please assist.

For eg : Lets say on some day just like blackfriday we are getting hits which are more than usual day and a single server on which docker image of web server is running is unable to handle those many requests. So is there any way or any method provided by docker which can spin off one more web server image automatically on spare server.

Please assist.

eldeberde · January 12, 2018, 3:09pm

Hi orj123
docker swarm services can be scaled with a command, but there are not an automatic way to do that.

I’m using a combination of cadvisor and node-exporter containers running on all docker nodes, to exports the metrics to a prometheus instance. Also have a grafana portal connected to the prometheus to get nice graphs. Also grafana can send alerts using mail or telegram bot

With that you will have all containers (and nodes) metrics in prometheus, and you can easily poll prometheus with a simple curl (or whatever you want to use) and depending on the values launch a command to scale up or down. (We are using the vmware orchestrator for that)

It is a complex way but it is possible and works so well for us.

If you need more details please feel free to ask.

Regards

orj123 · February 28, 2018, 6:20am

thanks a lot for your suggestion , I will definitely work on your suggested solution. Thanks again

jameshawn · February 28, 2018, 5:01pm

this was so so helpful for me! i send you a million thanks for this, you’ve made my life so much more easier!

mayurdoc · April 25, 2018, 10:10am

Hello Eldeberde,

Can you provide detail information?

We have java microserservices in docker swarm and we want to scale up and scale down these service based on number of request.

Thanks

eldeberde · April 25, 2018, 1:13pm

Hi.

Im using cadvisor to collect all containers metrics. deployed like a global service, so 1 replica in each host.

Prometheus is pooling this cadvisor service in each node and after that you can pool prometheus:
This is the very basic configuration lines to pool cadvisor from prometheus:

job_name: ‘cadvisor’
dns_sd_configs:
- names: [‘tasks.cadvisor’]
  type: A
  port: 8080

Prometheus is also a service, and we are using the internal docker dns resolver to pool the service “cadvisor” in the exposed port.

You can pool prometheus with a simple curl or whatever you want, this query is the consumed total cpu consumed by a service (all replicas of the service across the nodes in cluster).

sum(rate(container_cpu_user_seconds_total{container_label_com_docker_swarm_service_name=~“SERVICE_NAME”,id=~"/docker/.*"} [1m])*100)

Or you can create your custom query if you need scale based on memory or whatever

After that with this cpu usage per service metric you can decide if you need scale the service or not.

If you need to scale, you can get the number of running replicas:

“# REPLICAS =$( docker service ps SERVICE_NAME | grep Running| wc -l)”

And launch a command:

"# docker service update --replicas $REPLICAS + 1 "

You can deploy Prometheus like a docker service also, and it shouldn’t take so much time writting a custom script which pool prometheus and the docker manager to get the data and took a decision.

Regards

trajano · April 26, 2018, 1:54am

Combining your ideas

cadvisor current container performance metrics
node-exporter current node performance metrics
prometheus historical data

In addition to using the ServiceUpdate endpoint which can be accessed via TLS socket (ideal) or /var/run/docker.sock (unsecure but simpler) I think one can create a service that checks prometheus for cadvisor/node-exporter data and based on some logic increase the number of replicas as needed.

Although this is already done and supported in k8s. It would be “cool” to have something like this in swarm. I’m not really a big proponent of auto-scaling unless it is a matter of dynamically provisioning another AWS instance and let it register into the swarm because you pay for the CPU/Memory by time anyway according to a comment in Server Fault.

eldeberde · April 26, 2018, 7:36am

You’re right. I’m also thinking in give a try to K8s.
But in the other hand Swarm has some thinks i like.

The integrated load balancer make simple integrate the new replicas in it. You don’t need a Load balancer in front of all your services.
This solution allow us to scale services based in other parameters, like a rabbitmq queue length.

For us autoscaling means much less resources dedicated to machines working only a few hours per week or even per month, so now can share this resources and be available all time.

Also you can deploy new docker nodes and add it to the swarm with one simple command, so this solution could also be valid for that.

trajano · April 26, 2018, 12:43pm

I thought that too but then with container technology, not VM technology the resources are not “reserved” unless you explicitly say so in the YML with the reserve keyword to reserve CPU and memory for the specific container. Otherwise it will only use what it is really needed. Though if you chose 1000 scale and each container takes 100MB minimum that’s another story.

mrpatrick · April 26, 2018, 2:46pm

There is also this project:

Although I haven’t tried it yet.

trajano · April 26, 2018, 6:34pm

Some people like it more because it is more mature and flexible. I still prefer swarm if I have to make it sustainable when I leave projects when I am done.

eldeberde · April 26, 2018, 6:46pm

That’s nice! But looks to expose a easy way to scale. Not the logic behind.

But I will try it.

Thanks

lhenry · August 2, 2018, 1:05pm

Also take a look at https://monitor.dockerflow.com/auto-scaling/
I’m using this approach, with the one change of using a piece of go code similar to gianarb/orbiter to handle the actual docker scale cmd rather than Jenkins, which is what vfarcic is using.

duycuong87vn · July 4, 2019, 11:32am

I agree with @eldeberde ,
In addition, it’s easy to use AWS Cloud Services (as Auto Scaling Group).

omarmissoum · May 21, 2024, 7:57am

Hello, i’m in the same situation i have java tomcat microservices and i wanna integrate docker autoscaling, can you please tell me how did you do it ?

bluepuma77 · May 23, 2024, 6:24am

Docker Swarm has no auto-scaling.

So you need a third party tool to monitor usage and potentially scale up your service based on some metrics. But be aware that just more instances may not be enough, if you utilize existing resources already for 100%.

If you want real auto-scaling, which also can add cloud resources (more CPUs, RAM), you probably need to look into Kubernetes.

Topic		Replies	Views
Is it possible to auto-scale the number of replicas of a service when using Swarm? Swarm docker	2	3867	January 11, 2018
How to use docker swarm with nodes in autoscaling? Swarm amazonwebservices	3	2102	June 5, 2023
Autoscaling service? Swarm	0	1267	January 6, 2017
Swarm auto scale nodes Docker Hub	0	478	May 30, 2023
Auto scale nodes in a swarm Feature Requests	1	2942	December 28, 2016

Autoscaling in docker swarm

Related topics