What should be /dev/shm size for larger models

NilayYadav · April 7, 2025, 6:04pm

I’m seeing large models (20B+) crash during startup (loading/compilation) when /dev/shm is limited to 8GiB, but they succeed with 16GiB. Does vLLM initialization require more than 8GiB of /dev/shm for such models even when tensor_parallel_size=1, and what size do you recommend? Also, does it even matter?

Topic		Replies	Views
32GB vs 48GB vRam NVIDIA GPU Support	1	34	May 3, 2025
Which arguments affect GPU memory General	1	118	April 29, 2025
Multimodal inference guideline? General	19	88	June 24, 2025
vLLM V1 - Default max CUDA graph size V1 Feedback	1	382	April 12, 2025
How to load the model successfully through multi-card in vllm? General	5	111	April 3, 2025

What should be /dev/shm size for larger models

Related topics