vLLM Forums
Dp8ep8 执行模型时没有回显,原因是卡在sampler
General
yangshanjun
November 20, 2025, 6:04am
3
@RunLLM
具体是卡在sample里的这一行了,这一行只是一个数据类型转换,应该不会卡吧
image
1116×957 129 KB
show post in topic
Related topics
Topic
Replies
Views
Activity
FlashMLA issue when running FP8 Deepseek V8 model on H20
General
3
153
September 9, 2025
Why is it so slow to build a odeVLLM from source using Docker?
General
39
110
January 17, 2026
RuntimeError: Int8 not supported on SM120. Use FP8 quantization instead, or run on older arch (SM < 100)
NVIDIA GPU Support
3
156
November 27, 2025
Pp8并行,update_from_output 会等所有rank的 model_executor.execute_model 执行完了之后才会执行吗
General
84
316
January 8, 2026
Vllm中,deepseek的模型 刷新kvcache的地方在哪
General
44
115
March 30, 2026