Use editable vllm for different GPU types

tianyu · August 31, 2025, 11:14pm

I build vllm from source in a docker image.

RUN git clone https://github.com/vllm-project/vllm.git /workspace/vllm
WORKDIR /workspace/vllm
RUN uv pip install --system -e . --verbose

I’m wondering whether the wheel built in this docker image can be used across GPU types.

RunLLM · August 31, 2025, 11:15pm

No, a vLLM wheel built from source in a Docker image is typically only compatible with the GPU architecture present during the build, unless you explicitly specify multiple GPU architectures using the --build-arg torch_cuda_arch_list argument when building the image. By default, vLLM builds for all GPU types for widest distribution, but if you set torch_cuda_arch_list="", it will only build for the current GPU type, making the wheel not portable across different GPU types. To ensure compatibility across GPU types, you must manually specify all desired architectures during the build process as described in the documentation and community discussions (docs, discussion).

Would you like more detail on how to specify multiple GPU architectures or example Docker build commands?

Sources:

Topic		Replies	Views
How to build a VLLM python wheel can be used by other GPU types? General	2	106	March 21, 2025
Run on B200/5090 without building from source? NVIDIA GPU Support	1	122	May 1, 2025
Build VLLM without installing gcc? General	1	203	June 20, 2025
Make install easier General	11	114	July 24, 2025
Anybody tried to use VLLM with torch==2.7.1 General	1	296	July 2, 2025

Use editable vllm for different GPU types

Related topics