Accessing nvidia drivers at build time?

docdocced · January 1, 2025, 5:44pm

Hello,
I’m working on windows 11 with docker-desktop.
I have everything setup and working to run docker images with cuda; i can run a container that launches “nvidia-smi” successfully, so the nvidia drivers are available from a running container.
For example, this command works well and returns all the cuda info correctly:

docker run --gpus all nvidia/cuda:12.6.3-cudnn-devel-ubuntu24.04 nvidia-smi
=> NVIDIA-SMI 565.77.01 Driver Version: 566.36 CUDA Version: 12.7

But when trying to BUILD anything that requires the nvidia-smi fails.
For example, putthing this in the dockerfile:

RUN nvidia-smi
=>
[9/9] RUN nvidia-smi:
0.479 /bin/sh: 1: nvidia-smi: not found

I need the nvidia-smi command at build time to build an executable with the cuda feature:

RUN cargo build --release --features cuda

The problem is that the nvidia-smi command is available at runtime (through the “–gpus all” option), but not at build time.
Is there any way to fix that ?

Any help welcome
Thanks
Cedric

rimelek · January 1, 2025, 6:23pm

You can comment on this issue:

github.com/moby/buildkit

Support `RUN --gpus ...` or equivalent.

opened 08:29AM - 10 Apr 20 UTC

gabrieldemarmiesse

kind/enhancement help wanted

COPY --from=https://github.com/docker/buildx/issues/239 ./ ./ Hi and thanks …for this awesome tool! We make heavy use of buildkit (soon buildx) in Tensorflow Addons to run our tests. It's awesome to be able to run all those tests in parralel in isolated environements, even on remote servers. With the docker cache it's awesome, no need to pull the dependencies again... etc. Something that we're stuck with though is that we can't run our gpu tests in a dockerfile (like we do for all the other tests). Would it be possible to support gpus for an entire stage? Or even better for a single RUN instruction? Thanks a lot!

Since buildx doesn’t run actual Docker containers and it is the default builder, it has to be implemented in buildx. Some people shared workarounds but I have no idea whether it would wokk with Docker desktop or at all.

docdocced · January 1, 2025, 11:04pm

Thanks for the suggestion.
I ended up building an intermediate image that i launched with --gpus all, then i could compile using the nvidia driver within the container, and finally commited the final container into a new image containing everything.
It’s a bit more work, not elegant, but it’s simple and it works.
I really hope that one day docker will be able to build with a --gpus all option.
Cheers
Cedric

system · January 11, 2025, 11:05pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Docker build with NVIDIA runtime such as “--gpus” flag Feature Requests	1	2062	July 21, 2022
Getting NVIDIA CUDA GPU to run on Windows Docker Desktop Docker Desktop windows	3	1133	October 27, 2024
Error in exec nvidia-smi after executing docker run --gpus all Docker Hub dockerhub , docker , build	1	1201	March 6, 2022
I am trying to run a Docker container using nvidia/cuda:11.8.0-base-ubuntu22.04 as the base image, with PyTorch and CUDA-enabled dependencies to execute a FastAPI application. The application works perfectly on my local machine and correctly detects CUDA Docker Desktop windows	10	347	December 10, 2024
Applications not using GPU inside the container General	4	1848	April 12, 2024

Accessing nvidia drivers at build time?

Related topics