vLLM install for 5090

youlearnit · August 2, 2025, 8:09am

Where can I find step by step instructions for installing vLLM for 5090 on Ubuntu 24.04?

RunLLM · August 2, 2025, 8:09am

Step-by-step instructions for installing vLLM on an RTX 5090 with Ubuntu 24.04 are available in several community discussions and tutorials. The recommended approach is to build vLLM from source with CUDA 12.8 and the latest PyTorch nightly, as official wheels and Docker images may not yet support the 5090 out-of-the-box. A detailed tutorial is provided at this community guide, which covers environment setup, building from source, and running vLLM on the 5090. Additionally, you may need to update NCCL to version 2.26.5 or higher for multi-GPU support, as discussed in this issue thread.

If you want to use Docker, you can follow the build instructions and optimization flags shared in this issue comment, which includes steps for building a compatible Docker image and running vLLM with the correct CUDA architecture flags for Blackwell GPUs. Official support for 5090 is being improved, but for now, building from source or using a custom Docker build is necessary.

Would you like a step-by-step breakdown of the process?

Sources:

Topic		Replies	Views
vLLM on RTX5090: Working GPU setup with torch 2.9.0 cu128 NVIDIA GPU Support	15	1140	September 3, 2025
Project: vLLM docker for running smoothly on RTX 5090 + WSL2 General	1	58	October 5, 2025
Errors When Running VLLM + DeepSeek on RTX 5090 — Existing Solutions Not Working General	2	1596	September 1, 2025
Docker image `vllm/vllm-openai:v0.9.0` doesn't work on 5090 General	3	556	June 10, 2025
Make install easier General	11	185	July 24, 2025

vLLM install for 5090

Related topics