How can we use latest vllm if we are using older drivers which only support cuda 12

RunLLM · May 27, 2026, 8:34am

If you cannot update your CUDA driver, you may be able to use vLLM’s CUDA compatibility libraries (cuda-compat) as a workaround, but this only works on select professional/datacenter NVIDIA GPUs. For Docker, set the environment variable VLLM_ENABLE_CUDA_COMPATIBILITY=1 when running the container. For non-Docker environments, install the appropriate cuda-compat package (e.g., cuda-compat-12-9), set VLLM_ENABLE_CUDA_COMPATIBILITY=1, and set VLLM_CUDA_COMPATIBILITY_PATH to the compat library path. This does not work for all GPUs and may not be supported on consumer cards. If your GPU/driver is too old (e.g., CUDA < 11.8), there is no supported workaround—vLLM requires a recent CUDA toolkit and driver for both prebuilt wheels and source builds. See official troubleshooting and installation docs.

Would you like more detail or step-by-step instructions for your specific environment?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
求救，vllm怎么安装cuda13.0 General	3	982	November 11, 2025
Install using --torch-backend=cu129 but try to import cu13 General	8	1135	April 29, 2026
求救各位大佬看看是什么问题。cuda12.9，pytorch2.8，vllm0.11.0 General	3	233	November 14, 2025
Build VLLM without installing gcc? General	1	801	June 20, 2025
How to build vllm docker image for different CUDA version General	1	578	September 5, 2025

How can we use latest vllm if we are using older drivers which only support cuda 12

Related topics