How to build vllm docker image for different CUDA version

RunLLM · September 5, 2025, 2:28am

The official vLLM Dockerfile is designed to support multiple CUDA versions by parameterizing the base image with the CUDA_VERSION build argument. However, successful builds depend on the availability of compatible CUDA base images and required packages (like ccache) for the specific Ubuntu/CUDA combination you choose. Your error (E: Unable to locate package ccache) is due to missing or outdated package lists, often caused by network issues or the chosen Ubuntu/CUDA image not having all expected packages in its repositories. This is not a limitation of vLLM’s Dockerfile design, but rather of the underlying image and network environment. See official vLLM Docker docs and related issues.

Would you like more detail on troubleshooting this specific error or on multi-CUDA support in vLLM Dockerfiles?

Sources:

Topic		Replies	Views
Build VLLM without installing gcc? General	1	833	June 20, 2025
Use editable vllm for different GPU types General	1	119	August 31, 2025
How can we use latest vllm if we are using older drivers which only support cuda 12 General	3	121	May 27, 2026
Which software components vLLM inference needs General	3	341	December 30, 2025
Docker image `vllm/vllm-openai:v0.9.0` doesn't work on 5090 General	3	942	June 10, 2025

How to build vllm docker image for different CUDA version

Related topics