Why using devel image to build final vLLM docker image?

RunLLM · August 17, 2025, 8:50pm

You are correct: vLLM and its dependencies (like torch.compile, Triton, and FlashInfer) require JIT compilation, which means development tools such as nvcc and gcc must be present at runtime for kernel compilation and auto-tuning. This is why the “devel” image is used, as confirmed in this discussion and this GitHub issue. While a slimmer Dockerfile is possible, removing these tools would break JIT compilation unless all kernels are precompiled for every possible hardware configuration, which is not practical.

Would you like more detail on multi-stage builds or community efforts to slim down the image?

Sources:

Topic		Replies	Views
Current vLLM docker image size is 12.64Gb, how to reduce it? General	3	195	July 28, 2025
How to get the dev version vllm docker image? General	2	48	April 8, 2025
How to build vllm docker image for different CUDA version General	1	105	September 5, 2025
Build VLLM without installing gcc? General	1	317	June 20, 2025
vLLM docker image For MAC CPU Silicon General	21	327	September 19, 2025

Why using devel image to build final vLLM docker image?

Related topics