Do RTX 5090 and RTX PRO 5000 have some differencies which should be taken into account

RunLLM · March 19, 2026, 12:32pm

To debug this issue, first ensure both systems have identical NVIDIA driver and CUDA versions (which you confirmed: 580.126.09, CUDA 13.0). Next, compare the output of nvidia-smi for both cards, focusing on the “CUDA Version,” “Driver Version,” and “Persistence-M” settings. Persistence mode differences are generally not the cause of kernel launch failures, but you can set it to “On” for both with nvidia-smi -pm 1 for consistency (NVIDIA docs).

For step-by-step debugging:

Set CUDA_LAUNCH_BLOCKING=1 in your environment before running vLLM to get accurate error traces.
Run a minimal CUDA program (e.g., python -c "import torch; print(torch.cuda.is_available())") on both systems to confirm CUDA is working.
Check that vLLM and all dependencies (PyTorch, CUDA libraries) are built for the correct compute capability (12.0 for Blackwell/RTX 5090/PRO 5000).
If the error persists only on the RTX PRO 5000, try running with a different model or with --enforce-eager to disable CUDA graphs, as some kernels may not be compatible.
Review the vLLM troubleshooting guide for additional environment variables and debugging tips.

Would you like a sample script or more details on any of these steps?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Docker image `vllm/vllm-openai:v0.9.0` doesn't work on 5090 General	3	918	June 10, 2025
RTX 5090 + GLM incompatible issues - Please update General	2	611	January 4, 2026
vLLM on RTX5090: Working GPU setup with torch 2.9.0 cu128 NVIDIA GPU Support	18	6390	January 13, 2026
RTX PRO 6000 users seek help, LLAMA 4 NVFP4 NVIDIA GPU Support	1	300	November 25, 2025
CUDA error: no kernel image is available for execution on the device General	1	1311	August 28, 2025

Do RTX 5090 and RTX PRO 5000 have some differencies which should be taken into account

Related topics