Why vllm docker run command gets stuck here

AnttiK · January 27, 2026, 1:25pm

why vllm docker run command gets stuck here

[aiter] import [module_aiter_enum] under /usr/local/lib/python3.12/dist-packages/aiter/jit/module_aiter_enum.so
[aiter] import [module_aiter_enum] under /usr/local/lib/python3.12/dist-packages/aiter/jit/module_aiter_enum.so

RunLLM · January 27, 2026, 1:25pm

vLLM Docker runs can get stuck at [aiter] import [module_aiter_enum] if the installed aiter package or its JIT modules are not compatible with your GPU architecture, or if the Docker image was built without support for your specific GPU (e.g., missing your GPU’s architecture in the build flags). This is especially common on AMD GPUs when the image does not include the correct PYTORCH_ROCM_ARCH for your hardware, causing the aiter JIT to hang during GPU enumeration or module import, as described in the root cause analysis for ROCm images and aiter integration issues in vLLM ([Bug]: rocm/vllm:rocm7.0.0_vllm_0.11.1_20251103 has an error with flash attention)[[Bug]: rocm/vllm:rocm7.0.0_vllm_0.11.1_20251103 has an error with flash attention · Issue #28052 · vllm-project/vllm · GitHub].

To resolve this, rebuild vLLM and aiter from source with the correct GPU architecture (e.g., set PYTORCH_ROCM_ARCH="gfx1103" for AMD Radeon 780M), or use a Docker image that explicitly includes your GPU’s architecture. If you are not on ROCm/AMD, ensure your CUDA or ROCm drivers and all dependencies are correctly installed and compatible with the vLLM Docker image ([Bug]: rocm/vllm:rocm7.0.0_vllm_0.11.1_20251103 has an error with flash attention)[[Bug]: rocm/vllm:rocm7.0.0_vllm_0.11.1_20251103 has an error with flash attention · Issue #28052 · vllm-project/vllm · GitHub]. Would you like more detail or troubleshooting steps?

Sources:

https://github.com/vllm-project/vllm/issues/28052

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
vLLM v0.15.1 failing when deployed in AWS General	3	894	February 6, 2026
Help for error when run vllm with tensor parallel General	7	2086	May 21, 2025
Use editable vllm for different GPU types General	1	119	August 31, 2025
Pre-Built Docker Install General	2	117	February 8, 2026
Updated rocm/vllm latest and not anymore working General	1	385	November 4, 2025

Why vllm docker run command gets stuck here

Related topics