vLLM Forums
Pp8并行,update_from_output 会等所有rank的 model_executor.execute_model 执行完了之后才会执行吗
General
yangshanjun
December 11, 2025, 8:08am
21
@RunLLM
image
629×102 10.4 KB
看代码,不是把未调度的请求,从 input_batch中删除掉了吗
← previous page
Related topics
Topic
Replies
Views
Activity
Scheduler in vllm
Features
1
252
June 26, 2025
VLLM V1 Scheduler: Inconsistent Request Scheduling Under Token Budget Limit
General
25
120
December 17, 2025
从 cpu视角看,这个地方的 self.model_runner.execute_model 是不是立马就返回了
General
3
14
November 27, 2025
Question about schedule with V1 pipeline parallelsim
General
5
84
July 17, 2025
Dp 8启动,没有使能 --enable_expert_parallel,为什么还有 ep rank的打印
General
1
14
November 24, 2025