VGPU on podman "No CUDA GPUs are available"

jerome · May 23, 2026, 6:19am

I have an NVIDIA L40S with:

Driver 580.95.05 + CUDA 13.0
nvidia-smi works perfectly
Torch sees the GPU without any problems (torch.cuda.is_available() == True)

However, all modern inference engines fail with the error:

RuntimeError: No CUDA GPUs are available
This error only occurs in subprocesses (vLLM’s EngineCore)
Tested:

vLLM 0.21.0:

VLLM_USE_V1=0
VLLM_WORKER_MULTIPROC_METHOD=spawn and fork
All possible NVML bypasses (VLLM_DISABLE_PYNVML, PYTORCH_NVML_DISABLE, etc.)
Clean reinstall of PyTorch cu130 + vLLM

NVML / pynvml :
nvidia-ml-py reinstalled
Driver reloaded (rmmod + modprobe nvidia)
Torch works, but pynvml.nvmlDeviceGetHandleByIndex() returns NVMLError_Unknown

Environment

OS: RHEL-9
Podman version 5.4.0
Template: DeepSeek-Coder-V2-Lite-Base
GPU: NVIDIA L40S 48GB

Question
This problem seems related to an NVML incompatibility with the 580 driver + CUDA 13.0 in child processes (multiprocessing/spawn).

Has anyone else encountered this problem on an L40S or with the 580 driver?

Thanks in advance!

Topic		Replies	Views
Why is this not working? I corrected it but still NVIDIA GPU Support	1	905	May 8, 2025
求救各位大佬看看是什么问题。cuda12.9，pytorch2.8，vllm0.11.0 General	3	230	November 14, 2025
How can we use latest vllm if we are using older drivers which only support cuda 12 General	3	10	May 27, 2026
RTX PRO 6000 users seek help, LLAMA 4 NVFP4 NVIDIA GPU Support	1	293	November 25, 2025
Can anyone help me? Why is this not working? It used 😭 NVIDIA GPU Support	1	1200	May 8, 2025

VGPU on podman "No CUDA GPUs are available"

Related topics