Measurements of impact of combining RUN/LABEL commands?

davidmichaelkarr · June 21, 2016, 8:58pm

I’ve read in multiple places about the advice for combining RUN and LABEL commands (others?) to minimize layering, for a “more efficient image”. That seems reasonable, but AUFS supposedly makes this kind of thing pretty efficient. has anyone actually measured the efficiency difference and reported on it?

dmaze · June 21, 2016, 9:47pm

I think there are two practical reasons to want fewer layers:

If you have a step that downloads things then tries to clean up after itself, those must be in the same RUN step (a standalone RUN rm ... layer is useless)
There’s a limit of (IIRC) 127 layers, and if your setup is especially complicated you could bump into this

Occasionally it’s helpful to have more:

If you’re actively developing a Dockerfile, layer caching can skip the first four steps of a six-step sequence if you have separate RUN commands, but not if they’re all squished together
If you have very large (gigabyte-sized) layers, they can become unwieldy in a couple of ways, and having a single-purpose giant COPY layer or even splitting it into several layers merely hundreds of megabytes large can make docker push more reliable

I’d imagine the actual performance impact on disk I/O of the running container is negligible and have never thought to look.

davidmichaelkarr · June 21, 2016, 9:56pm

[dmaze] dmaze https://forums.docker.com/users/dmaze David Maze
https://forums.docker.com/users/dmaze
June 21

I think there are two practical reasons to want fewer layers:

If you have a step that downloads things then tries to clean up
after itself, those must be in the same |RUN| step (a standalone
|RUN rm …| layer is useless)

There’s a limit of (IIRC) 127 layers, and if your setup is
especially complicated you could bump into this

I could use some clarification of the first point.

Concerning the limit, I believe it’s 42 (someone with a sense of humor
there?), so it’s even worse than that.

Occasionally it’s helpful to have more:

If you’re actively developing a |Dockerfile|, layer caching can
skip the first four steps of a six-step sequence if you have
separate |RUN| commands, but not if they’re all squished together

If you have very large (gigabyte-sized) layers, they can become
unwieldy in a couple of ways, and having a single-purpose giant
|COPY| layer or even splitting it into several layers merely
hundreds of megabytes large can make |docker push| more reliable

I’d imagine the actual performance impact on disk I/O of the running
container is negligible and have never thought to look.

The overall advantages of layering are pretty clear.

dmaze · June 21, 2016, 10:40pm

Let’s say your Dockerfile says

RUN apt-get update
RUN apt-get install nginx
RUN apt-get clean
RUN rm -rf /var/lib/apt/lists

Each RUN command makes a layer. The first two download things. Even though the last two layers delete the things the first two downloaded, the first two layers are still part of the final image, including all of the downloaded content. If instead you say

RUN apt-get update \
 && apt-get install nginx \
 && apt-get clean \
 && rm -rf /var/lib/apt/lists

then all of this happens in a single layer, and the intermediate package lists and .deb packages aren’t in the final image.

Topic		Replies	Views
Relative impact of combining multiple RUN commands? General	2	1097	February 2, 2025
Do number of layers (in an image) affect the runtime performance? General	1	1913	December 19, 2017
Do number of layers (images) affect the runtime performance of docker? General	0	772	April 4, 2017
The Best Strategies to Slim Docker Images General tutorial , tips	6	3004	July 21, 2023
Does each layer in docker container filesystem contain the full filesystem or only the changed ones? Image Builds docker , build	5	176	November 9, 2024

Measurements of impact of combining RUN/LABEL commands?

Related topics