Layer caching doesn't seem to be working as expected (but how do I check)

timdiggins · November 6, 2023, 8:11am

Hi there

I have several base images which depend on each other (base1, base2, base3, base4) and then final images which depend on each of the bases (e.g. final1 depends on base1, final4 depends on base4). The dependencies are always by a specific tag which is created at that moment. Whenever I build, I also push the tag to the relevant repo (which are stored in AWS’s ECR container repository)

base4 has a 8G layer, so I really notice when I need to push this layer.

I have built and pushed a base4 version (a while ago). And now I’ve just built a final4 version. Many of its layers are cached but the 8G layer is not (I can tell because it takes ages to push this layer).

I was trying to work out why this is. But I’m not sure how to do this. I can see that I’ve pulled the relevant base4 image (because docker pull says it’s up to date). I can use docker history on base4 and final4 but I can’t see anything that can tell me the identity (hash?) of the layer.

Is there anything I can do to work out why the large layer is having to be pushed again? (or indeed work out if it is actually identical - it should be, but maybe I’ve made a mistake)

thanks in advance

Docker version 24.0.6, build ed223bc
MacOS 13.5.2 (22G91)
Docker desktop 4.24.2 (124339)

meyay · November 6, 2023, 7:04pm

An image consists of a manifest + image layers used in the manifest.

By convention, a registry should indicate that an image layer already exists and shouldn’t be pushed again. In your case this doesn’t seem to happen. This has nothing to do with caching or docker itself… unless of course there is a bug, which then high likely would happen with other registries as well.

When you build your final4 image on your host it will be stored in the host’s local image cache. If you push this image the first time, it will push the manifest + all image layers that don’t exist in the registry. If you push the same image (without deleting and rebuilding the image) later again, it will only push the manifest, but no image layers, as they already exist in the registry.

Now let’s assume you delete the image from the local image cache of your host, and clean the build cache and rebuild the image, just a moment after pushing the image, it will result in a new manifest and new image layers. Which, if pushed, will result in every image layer being uploaded again. An image build is repeatable, but not reproducible because timestamps of files or timestamps embedded into compiled binaries (and even the manifest) will be different.

If you build and push base4, then build final4 based on base4 and push final4, the push should detect that the image layers of base4 already exist and should not push them. If it does, consult the ECR documentation (you could be missing a relevant IAM policy), or consult AWS support.

timdiggins · November 8, 2023, 12:45pm

Thanks very much for your response.

Is there a way (through CLI) to check the identify of a specific layer (in one image) and compare it to the identity in another layer? (Like comparing the SHAs on two git commits /trees )

Similarly is there a way to see if a specific layer is cached (present) locally / or in a remote repository?

timdiggins · November 14, 2023, 5:46pm

FYI for future reference, it turns out this is a long-time missing feature (OMG) from ECR: [ecr] [request]: cache layers between repositories · Issue #531 · aws/containers-roadmap · GitHub

meyay · November 14, 2023, 6:02pm

Thank you for the update!

Topic		Replies	Views
Copies of layers appearing in push Docker Hub docker , build	0	1757	September 28, 2016
Docker daemon image layer reuse General	6	4444	June 5, 2017
Docker pushes always the hole base image to registry Feature Requests	1	1288	November 28, 2016
Docker Caching - do I restore my cache when I pull an image? Docker Hub	3	11980	April 26, 2023
Questions about behavior regarding images General	1	384	September 27, 2020

Layer caching doesn't seem to be working as expected (but how do I check)

Related topics