|
About the NVIDIA GPU Support category
|
|
0
|
117
|
March 20, 2025
|
|
vLLM hangs during worker initialization on Blackwell PCIe GPUs unless --disable-custom-all-reduce is used
|
|
1
|
248
|
April 11, 2026
|
|
# SM120 (RTX PRO 6000) NVFP4 MoE Performance Report -- Qwen3.5-397B
|
|
1
|
577
|
April 11, 2026
|
|
Vllm启动时,日志卡在nccl相关部分,不继续往下
|
|
16
|
1458
|
April 8, 2026
|
|
SM120 (RTX PRO 4000): 6.5x throughput gain and v0.18.1 regression findings
|
|
1
|
429
|
April 3, 2026
|
|
MoE config on GH200
|
|
9
|
453
|
February 4, 2026
|
|
vLLM on RTX5090: Working GPU setup with torch 2.9.0 cu128
|
|
18
|
5973
|
January 13, 2026
|
|
Support for RTX 6000 Blackwell 96GB card
|
|
5
|
6178
|
January 5, 2026
|
|
How to apply FA4 on B200?
|
|
3
|
495
|
December 18, 2025
|
|
RTX PRO 6000 users seek help, LLAMA 4 NVFP4
|
|
1
|
285
|
November 25, 2025
|
|
RuntimeError: Int8 not supported on SM120. Use FP8 quantization instead, or run on older arch (SM < 100)
|
|
3
|
190
|
November 27, 2025
|
|
Need help compiling and running on Jetson Thor
|
|
4
|
813
|
November 1, 2025
|
|
RTX Pro 6000 Tensor Parallelism CUBLAS_STATUS_ALLOC_FAILED
|
|
3
|
420
|
September 13, 2025
|
|
vLLM Benchmarking: Why Is GPUDirect RDMA Not Outperforming Standard RDMA in a Pipeline-Parallel Setup?
|
|
1
|
538
|
August 14, 2025
|
|
GPU Time Slicing
|
|
0
|
195
|
July 16, 2025
|
|
KV Cache quantizing?
|
|
3
|
1038
|
June 2, 2025
|
|
Struggling with my dual GPU setup. And getting chat template errors
|
|
2
|
251
|
May 30, 2025
|
|
Why is this not working? I corrected it but still
|
|
1
|
898
|
May 8, 2025
|
|
Can anyone help me? Why is this not working? It used 😭
|
|
1
|
1181
|
May 8, 2025
|
|
Docker explosion this morning after it worked fine for a long while
|
|
6
|
500
|
May 6, 2025
|
|
32GB vs 48GB vRam
|
|
1
|
1125
|
May 3, 2025
|
|
Run on B200/5090 without building from source?
|
|
1
|
263
|
May 1, 2025
|
|
Making best use of varying GPU generations
|
|
2
|
890
|
April 11, 2025
|
|
Jetson orin, CUDA error: no kernel image is available for execution on the device
|
|
0
|
494
|
March 29, 2025
|